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□ 1. Document ID: US 6773880 B2 

Ll: Entry 1 of 5 File: USPT Aug 10, 2 004 



DOCUMENT -IDENTIFIER: US 6773880 B2 

TITLE: Streptococcus pneumoniale 37-kDa surface adhesion A protein 



Detailed Description Text (73) : 

The complete nucleotide sequence on both strands of the 1.3-kb insert was 
determined by cycle sequencing of the plasmid subclone using oligonucleotide 
primers complementary to the sequence. These were made as sequence information 
became available. The nucleotide sequence of the entire streptococcal insert is set 
forth in the Sequence Listing as SEQ ID NO : 1 . The single open reading frame (ORF) 
present, beginning at nucleotide (nt) 189 and ending at nt 1117, encodes the psaA 
gene sequence. This ORF is 93 0 nt long and when amplified and subcloned into vector 
systems such as pGEM (Promega, Madison, Wis.) and BAC-to-BAC . sup . tm expression 
system (Bethesda Research Laboratories, Gaithersburg, Md.) expresses full-length 
PsaA, reactive with anti-PsaA MAb antibodies. This ORF encodes a peptide of 309 
amino acids with a deduced molecular weight of 34,598 and an isoelectric point of 
5.23. Analysis of the peptide using the algorithm of Kyte and Doolittle (Kyte et 
al. 1982. "A simple method for displaying the hydropathic character of a protein, . " 
""J. Mol. Biol. 157:105-132) shows that the peptide contains a major hydrophobic 
region of 20 amino acids which encodes a putative leader sequence. This leader 
^ contains the' consensus jtej^ejn^e Removal of 

this leader would result in a peptide of molecular mass 32,465 with a predicted 
isoelectric point of 4.97. A consensus sequence for a ribosomal binding site (Shine 
et al . 1974. "The 3 ' -terminal sequence of E. coli 16S ribosomal RNA: 
complementarity to nonsense triplets and ribosomal binding sites." Proc . Natl. 
Acad. Sci. USA 71:1324-1346) is located 5 nt upstream of the ATG start codon . 

D etailed Description Text (76) : 

A comparison of serotype 2 and 6B PsaAs shows almost complete identity: the 
computed similarity value is 99.3. The eight base difference at the nucleotide 
level translated into a difference at the peptide level of six amino acids with two 
of the changes resulting in conservative substitutions. Further analyses and 
comparisons of the serotype 6B sequence to the other five GenBank PsaA homologues 
from viridans Streptococci and E. faecalis (Fenno et al . 1989. "Nucleotide sequence 
analysis of a type 1 fimbrial gene of Streptococcus sanguis FW213." Infect. Immun. 
57:3527-3533, Sampson et al . 1994. "Cloning and nucleotide sequence analysis of 
psaA, the Streptococcus pneumoniae gene encoding a 37 -kilodalton protein homologous 
to previously reported Streptococcus sp. adhesins." Infect. Immun. 62:319-324, 
Ganeshkumar et al . 1991. "Nucleotide sequence of a gene coding for a salvia-binding 
protein (SsaB) from Streptococcus sanguis 12 and possible role of the protein in 
coaggregation with actinomyces . " Infect. Immun. 59:1093-1099, Kolenbrander et al . 
1994. "Nucleotide sequence of the Streptococcus gordonii PK488 coaggregation 
adhesin gene scaA and ATP-binding cassette." Infect. Immun. 62:4469-4480, and Lowe 
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All of the known Lral proteins are lipoproteins with the typical recognition 
sequence LxxC for the signal peptidase II at amino acid residue 16 or 17. The GBS 
homologue has a similar but slightly different sequence at this position (IAGC) 
with an exchange of leucine to isoleucine. LplA of Bacillus subtilis which has been 
shown to be a lipoprotein by radiolabelling with palmitate, is another example of 
this atypical recognition sequence in a gram-positive organism (Sutcliffe and 
Russel 1995) . 



Fun | Title I Citation | Front | Revising C lass tficatiori]D ate | Reference 



□ 4. Document ID: US 6217884 Bl 

Ll: Entry 4 of 5 File: USPT Apr 17, 2001 



DOCUMENT- IDENTIFIER: US 6217884 Bl 

TITLE: Streptococcus pneumoniae 3 7-kDa surface adhesin a protein 
Detailed Description Text (66) : 

The complete nucleotide sequence on both strands of the 1.3-kb insert was 
determined by cycle sequencing of the plasmid subclone using oligonucleotide 
primers complementary to the sequence.. These were made as sequence information 
became available. The nucleotide sequence of the entire streptococcal insert is set 
forth in the Sequence Listing as SEQ ID NO:l. The single open reading frame (ORF) 
present, beginning at nucleotide (nt) 189 and ending at nt 1117, encodes the psaA 
gene sequence. This ORF is 93 0 nt long and when amplified and subcloned into vector 
systems such as pGEM (Promega, Madison, Wis.) and BAC-to-BA. TM . expression system 
(Bethesda Research Laboratories, Gaithersburg, Md.) expresses full-length PsaA, 
reactive with anti-PsaA MAb antibodies. This ORF encodes a peptide of 309 amino 
acids with a deduced molecular weight of 34,598 and an isoelectric point of 5.23. 
Analysis of the peptide using the algorithm of Kyte and Doolittle (Kyte et al. 
1982. "A simple method for displaying the hydropathic character of a protein." J. 
Mol. Biol. 15 7:105-132) shows that the peptide contains a major hydrophobic region 
of 20 amino acids which encodes a putative leader sequence. This leader contains 
the consensus sequence for signal peptidase cleavage (L XXC ) . Removal of this leader 
would result in a peptide of molecular mass 32,465 with a predicted isoelectric 
point of 4.97. A consensus sequence for a ribosomal binding site (Shine et al. 
1974. "The 3' -terminal sequence of E. coli 16S ribosomal RNA: complementarity to 
nonsense triplets and ribosomal binding sites." Proc . Natl. Acad. Sci. USA 71:1324- 
1346) is located 5 nt upstream of the ATG start codon. 

Detailed Description Text (69) : 

A comparison of serotype 2 and 6B PsaAs shows almost complete identity: the 
computed similarity value is 99.3. The eight base difference at the nucleotide 
level translated into a difference at the peptide level of six amino acids with two 
of the changes resulting in conservative substitutions. Further analyses and 
comparisons of the serotype 6B sequence to the other five GenBank PsaA homologues 
from viridans Streptococci and E. faecalis (Fenno et al . 1989. "Nucleotide sequence 
analysis of a type 1 fimbrial gene of Streptococcus sanguis FW213." Infect. Immun. 
57:3527-3533, Sampson et al . 1994. "Cloning and nucleotide sequence analysis of 
psaA, the Streptococcus pneumoniae gene encoding a 37 -kilodalton protein homologous 
to previously reported Streptococcus sp . adhesins." Infect. Immun. 62:319-324, 
Ganeshkumar et al . 1991. "Nucleotide sequence of a gene coding for a salvia-binding 
protein (SsaB) from Streptococcus sanguis 12 and possible role of the protein in 
coaggregation with actinomyces . " Infect. Immun. 59:1093-1099, Kolenbrander et al . 
1994. "Nucleotide sequence of the Streptococcus gordonii PK488 coaggregation 
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adhesin gene scaA and ATP-binding cassette." Infect. Immun. 62:4469-4480, and Lowe 
et al . 1995. "Cloning of an Enterococcus faecalis endocarditis antigen: homology 
with some adhesins from oral streptococci." Infect. Immun 63:703-706) revealed 
significant sequence similarity between them. Sequence identities were 81%, 81%, 
77%, 82%, and 57%, respectively, for PsaA (S. pneumoniae strain R36A) , SsaB (S. 
sanguis), FimA (S. parasanguis) , ScaA (S. gordonii) and EfaA (E. faecalis). 
Additionally, all six sequences showed great similarity in organization. They have a 
a hydrophobic leader peptide containing the prolipoprotein consensus sequence LXXC 
(for signal peptidase II cleavage) within the first 17-20 amino acids. This N- 
terminal leader sequence appears to represent the area of greatest variability. It 
is followed by a region of high similarity from amino acids 36-150. The region from 
150 to 198 is a variable region and is followed by another conserved region (198- 
309) . 



DOCUMENT- IDENTIFIER: US 5854416 A 

TITLE: Streptococcus pneumoniae 37-KDA surface adhesin a protein and nucleic acids 
coding therefor 



Detailed Description Text (76) : 

The complete nucleotide sequence on both strands of the 1.3-kb insert was 
determined by cycle sequencing of the plasmid subclone using oligonucleotide 
primers complementary to the sequence. These were made as sequence information 
became available. The nucleotide sequence of the entire streptococcal insert is set 
forth in the Sequence Listing as SEQ ID NO : 1 . The single open reading frame (ORF) 
present, beginning at nucleotide (nt) 189 and ending at nt 1117, encodes the psaA 
gene sequence. This ORF is 930 nt long and when amplified and subcloned into vector 
systems such as pGEM (Promega, Madison, Wis.) and BAC-to-BAC . TM . expression system 
(Bethesda Research Laboratories, Gaithersburg, Md.) expresses full-length PsaA, 
reactive with anti-PsaA MAb antibodies. This ORF encodes a peptide of 309 amino 
acids with a deduced molecular weight of 34,598 and an isoelectric point of 5.23. 
Analysis of the peptide using the algorithm of Kyte and Doolittle (Kyte et al . 
1982. "A simple method for displaying the hydropathic character of a protein." J. 
Mol . Biol. 157:105-132) shows that the peptide contains a major hydrophobic region 
of 2 0 amino acids which encodes a putative leader sequence. This leader contains 
the consensus sequence for signal peptidase cleavage (LXXC) . Removal of this leader 
would result in a peptide of molecular mass 32,465 with a predicted isoelectric 
point of 4.97. A consensus sequence for a ribosomal binding site (Shine et al . 
1974. "The 3 1 -terminal sequence of E. coil 16S ribosomal RNA: complementarity to 
nonsense triplets and ribosomal binding sites." Proc . Natl. Acad. Sci . U.S.A. 
71:1324-1346) is located 5 nt upstream of the ATG start codon. 

Detailed Description Text (79) : 

A comparison of serotype 2 and 6B PsaAs shows almost complete identity: the 
computed similarity value is 99.3. The eight base difference at the nucleotide 
level translated into a difference at the peptide level of six amino acids with two 
of the changes resulting in conservative substitutions. Further analyses and 
comparisons of the serotype 6B sequence to the other five GenBank PsaA homologues 
from viridans Streptococci and E. faecalis (Fenno et al . 1989. "Nucleotide sequence 
analysis of a type 1 fimbrial gene of Streptococcus sanguis FW213." Infect. Immun. 
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57:3527-3533, Sampson et al . 1994. "Cloning and nucleotide sequence analysis of 
psaA, the Streptococcus pneumoniae gene encoding a 37-kilodalton protein homologous 
to previously reported Streptococcus sp. adhesins." Infect. Immun. 62:319-324, 
Ganeshkumar et al . 1991. "Nucleotide sequence of a gene coding for a salvia-binding 
protein (SsaB) from Streptococcus sanguis 12 and possible role of the protein in 
coaggregation with actinomyces . " Infect. Immun. 59:1093-1099, Kolenbrander et al . 
1994. "Nucleotide sequence of the Streptococcus gordonii PK488 coaggregation adhesin 
adhesin gene scaA and ATP-binding cassette." Infect. Immun. 62:4469-4480, and Lowe 
et al . 1995. "Cloning of an Enterococcus faecalis endocarditis antigen: homology 
with some adhesins from oral streptococci. " Infect. Immun 63:703-706) revealed 
significant sequence similarity between them. Sequence identities were 81%, 81%, 
77%, 82%, and 57%, respectively, for PsaA (S. pneumoniae strain R3 6A) , SsaB (S. 
sanguis), FimA (S. parasanguis) , ScaA (S. gordonii) and EfaA (E. faecalis). 
Additionally, all six sequences showed great similarity in organization. They have a 
a hydrophobic leader peptide containing the prolipoprotein consensus sequence LXXC 
(for signal peptidase II cleavage) within the first 17-20 amino acids. This N- 
terminal leader sequence appears to represent the area of greatest variability. It 
is followed by a region of high similarity from amino acids 36-150. The region from 
150 to 198 is a variable region and is followed by another conserved region (198- 
309) . 
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US-PAT-NO: 6773880 

DOCUMENT- IDENTIFIER: US 6773880 B2 

TITLE: Streptococcus pneumoniale 37-kDa surface adhesion A protein 
DATE -ISSUED: August 10, 2 004 



INVENTOR- INFORMATION : 
NAME 

Sampson; Jacquelyn 
Russell; Harold 
Tharpe; Jean A. 
Ades; Edwin W. 
Carlone; George M. 



CITY 

College Park 
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Atlanta 
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STATE ZIP CODE 

GA 
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COUNTRY 



ASS IGNEE - INFORMATION : 
NAME 

The United States of America as 
represented by the Department of 
Health and Human Services 

APPL-NO: 09/ 754809 [PALM] 
DATE FILED: January 3, 2 0 01 



CITY 



STATE ZIP CODE COUNTRY TYPE CODE 



Washington DC 



06 



PARENT -CASE: 

This application is a divisional of and claims the benefit of priority from, U.S. 
Ser. No. 09/221,753, filed Dec. 28, 1998 now U.S. Pat. No. 6,217,884, which is a 
divisional of U.S. Ser. No. 08/715,131, filed Sep. 17, 1996, now U.S. Pat. No. 
5,854,416, which is a continuation-in-part of U.S. Ser. No. 08/222,179, filed Apr. 
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incorporated herein by reference. 
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FOREIGN PATENT DOCUMENTS 



FOREIGN- PAT-NO PUBN-DATE COUNTRY US-CL 

0206 852 December 1986 EP 
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OTHER PUBLICATIONS 



Sampson, et al . , "Immonulogic Characterization of a Monoclonal Antibody to 
Streptococcus Pneumoniae Pneumococcal Surface Adhesin A (PsaA) Protein, " Centers for 
for Disease Control & Prevention, Atlanta, Ga 30333. 

Novak, et al . , "Penicillin tolerance genes of Streptococcus pneumoniae: the ABC-type 
type manganese permease complex Psa, " Molecular Microbiology, 29:1285-1296 (1998). 
Lawrence, et al . , "The crystal structure of pneumococcal surface antigen PsaA 
reveals a metal -binding site and a novel structure for a putative ABC- type binding 
protein," Structure, 6:1553-1561, (Dec. 1998). 

Amit, et al . , "Three-Dimensional Structure of a Antigen-Antibody Complex at 2.8 
Angstrom Resolution, " Science, 233:747-753 (Apr. 1986). 

Kolenbrander, et al . , "Nucleotide Sequence of the Streptococcus gordonii PK488 
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Infection and Immunity, 59 (4 ) : 1285 - 12 89 , (Apr. 1991). 
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of Streptococcus pneumoniae," Abstracts of the 91.sup.st General Meeting of the Am. 
Society for Microbiology Annual Meeting; p. 97, (1991) . 

Russell, et al., "Assay for Antibodies Against a Species -Specif ic Streptococcus 
Pneumoniae Antigen in Patients with Pneumococcal Disease," Abstracts of the 
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ATT Y- AGENT -FIRM: Needle & Rosenberg, P.C. 
ABSTRACT : 

The invention provides a nucleic acid encoding the 37-kDa protein from Streptococcus 
Streptococcus pneumoniae. Also provided are isolated nucleic acids comprising a 
unique fragment of at least 10 nucleotides of the 37-kDa protein. The invention also 
also provides purified polypeptides encoded by the nucleic acid encoding the 37-kDa 
protein from and the nucleic acids comprising a unique fragment of at least 10 
nucleotides of the 37-kDa protein. Also provided are antibodies which selectively 
binds the polypeptides encoded by the nucleic acid encoding the 37-kDa protein and 
the nucleic acids comprising a unique fragment of at least 10 nucleotides of the 37- 
37-kDa protein. Also provided are vaccines comprising immunogenic polypeptides 
encoded by the nucleic acid encoding the 37-kDa protein and the nucleic acids 
comprising a unique fragment of at least 10 nucleotides of the 37-kDa protein. 
Further provided is a method of detecting the presence of Streptococcus pneumoniae 
in a sample comprising the steps of contacting a sample suspected of containing 
Streptococcus pneumoniae with nucleic acid primers capable of hybridizing to a 
nucleic acid comprising a portion of the nucleic acid encoding the 37-kDa protein, 
amplifying the nucleic acid and detecting the presence of an amplification product, 
the presence of the amplification product indicating the presence of Streptococcus 
pneumoniae in the sample. Further provided are methods of detecting the presence of 
Streptococcus pneumoniae in a sample using antibodies or antigens, methods of 
preventing and treating Streptococcus pneumoniae infection in a subject. 

1 Claims, 0 Drawing figures 
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PGPUB- FILING- TYPE: new 
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COUNTRY x RULE -4 7 

BE 
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1. An immunogenic composition comprising at least 2 S. pnumoniae proteins wherein 
one of the proteins is selected from the polyhistidine triad family (PhtX) and 
another protein is selected from the group consisting of Choline Binding Protein 
family (CbpX), CbpX truncates, LyX family, LytX truncates, CbpX truncate -LytX 
truncate chimeric proteins, pneumolysin (Ply), PspA, PsaA, Spl28, SplOl, Spl30, 
Spl25 and Spl33 . 

2. An immunogenic composition comprising at least 2 S. pneumoniae proteins wherein 
one of the proteins is selected from the group consisting of Choline Binding 
Protein family (CbpX) , CbpX truncates, and CbpX truncate-LytX truncate chimeric 
proteins and another protein selected from the group consisting of polyhistidine 
triad family (PhtX), LytC, pneumolysin (Ply), PsaA, and Spl28. 

3. The immunogenic composition of claims 1-2 wherein Pht is PhtA, PhtB or PhtD . 

4. The immunogenic composition of claims 1-3 wherein CbpX is CbpA or PspC. 

5. The immunogenic composition of claims 1-4 additionally comprising an adjuvant. 

6. A vaccine comprising the immmogenic composition of claim 5. 

7. A method of eliciting an immune response by immunising a mammal with the 
immunogenic composition of claims 1-5. 

8 . A method of preventing or ameliorating Streptococcus infection in patients over 
55 years of age, comprising administering a safe and effective amount of the 
vaccine of claim 6 to said patients. 
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9. Use of the vaccine of claim 6 in the manufacture of a medicament for prevention 
of pneumonia in patients over 55 years of age. 

10. A method of preventing or ameliorating Otitis media in infants, comprising 
administering a safe and effective amount of the vaccine of claim 6 to said 
patients . 

11. A method of making a vaccine as claimed in claim 6 comprising the steps of: 
selecting and isolating two different S. pneumonia proteins; and mixing said 
proteins together with a pharmaceutically acceptable carrier. 
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ABSTRACT : 



The present invention relates to a combination of 2 or more S pneumoniae proteins, 
their manufacture and use in medicine as a vaccine. Such combinations are 
particularly useful for the protection of infants and elderly against streptococcal 
infection. 
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tr Q9 7QM9 Conserved domain protein [SP1174] [Streptococcus pneumoniae] 819 AA 

al ign 

Score = 423 bits (1087) , Expect = e-116 

Identities = 211/357 (59%) , Positives = 271/357 (75%) , Gaps = 11/357 (3%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQV 5 9 

MK +KKY+A GS +++LS+C+Y L ++++ Q+ K++NRV+Y+DG Q+ QK+ENLTPD+V 
Sbjct : 1 MKINKKYLA-GSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQKAENLTPDEV 5 9 

Query : 60 SQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIV 119 

S++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIV 
Sbjct : 60 SKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIV 119 

Query: 120 NEVKGGYIIKA/DGKYYVTLKI)AAHAD^ 178 

NE+KGGY+IKV+GKYYVYLKDAAHADN+RTK+EI RQKQE + N + ++ VA AR+QG 
Sbjct: 120 NE I KGG YVI KV1STGKY YVYLKDAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQG 179 

Query: 17 9 RYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQ 238 

RYTT+DGY+FN +DI IEDTG+AYIVPHG HYHYIP Q S+ 

Sbjct: 180 RYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIP- -KNELSASELAAAEAYWNGKQGSR 237 



Query: 23 9 LSYSSTASDNNTQ SVAKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVF 2 92 

S SS+ + N Q S TP N+ EN+ SLL+ELY P ++R+ ESDGL+F 

Sbjct: 238 PSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIF 297 

Query: 293 DPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 349 

DPA+I SRT GVA+PHG+HYHFIPY + + S LE++IAR +P+ + +++P E 

Sbjct: 298 DPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEE 354 
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Welcome to the SIB BLAST Network Service 

If results of this search are reported or published, please mention that 
the computation was performed at the SIB using the BLAST network service. 
The SIB BLAST network service uses a server developed at SIB and the NCBI 
BLAST 2 software. 

In case of problems, please read the online BLAST help . 

If your question is not covered, please contact < helpdesk@expasy . org > . 

NCBI BLAST program reference [PMID : 9254694 ] : 

Altschul S.F., Madden T.L., Schaffer A. A. , Zhang J., Zhang Z., Miller W., 
Lipman D.J. Gapped BLAST and PSI -BLAST: a new generation of protein 
database search programs. Nucleic Acids Res. 25:3389-3402(1997). 



Query length: 1039 AA (of which 3% low- complexity regions filtered out) 
Date run: 2004-09-27 14:41:04 UTC+0100 on sib-gml . unil . ch 
Program: NCBI BLASTP 1 . 5 . 4 -Paracel [2003-06-05] 
Database : EXPASY/UniProt 

1,569,420 sequences; 502,311,625 total letters 



List of potentially matching sequences 

Send selected sequences to jClustal W (multiple alignment) Ml l^B^jSUl^ 
f~ Include query sequence 
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Splice isoform Skeletal 3 of Q28820 [TRDN] [Oryct... 
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Q9VGH4 
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Alignments 



tr Q.. 8 .QQ07 Pneumococcal histidine triad protein E [phtE] [Streptococcus 103 9 AA 
pneumoniae (strain ATCC BAA-255 / R6) ] 

align 

Score = 2017 bits (5225), Expect =0.0 

Identities = 1004/1039 (96%) , Positives = 1004/1039 (96%) 

Query : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 6 0 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbjct : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGI QAEQ I VI KI TDQG YVTSHGDHYHY YNGKVP YDALFSEELLMKDPNYQLKDAD I VN 120 

QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 
Sbjct: 61 QKEG I QAEQ I VI KI TDQGYVTSHGDHYHY YNGKVPYDALFSEELLMKDPNYQLKDAD I VN 12 0 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN^ 18 0 

EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 
Sbjct: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

Query : 181 TTNDGYVFNPADI IEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 24 0 

TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIP NMQPSQLS 
Sbjct : 181 TTNDGYVFNPADI IEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKMMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query : 3 01 TPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVPI SGTGSTVSTNAKPNE WXXXXXXXXX 36 0 

TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEW 
Sbjct: 301 TPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WS S LGS LS SN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

KELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

Query: 4 81 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 54 0 

AQKHLEE VKTSHNGLDSLS SHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRES IWN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query : 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVA.KKEGNKVYTGEELTNV 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 6 01 WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 66 0 

VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct: 601 VNLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbj ct : 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 
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Query: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

GDT YLRVNPQF AVP KGTDALVRVFDE FHGNAYLENNYKVGE IKLP I PKLNQGTTRT AGNK 
Sbjct : 721 GDTYLRWPQFAVPKGTDALVRVFDE FHGNAYLENNYKVGE IKLP I PKLNQGTTRTAGNK 78 0 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
Sbjct : 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct : 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 1039 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 1039 



tr 25MQZ Surface protein BVH-3 [bvh-3] [Streptococcus 1039 AA 

pneumoniae] align 

Score = 2017 bits (5225), Expect =0.0 

Identities = 1004/1039 (96%), Positives = 1004/1039 (96%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbjct: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 18 0 

E VKGGYI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNEKVWSNVAVARS QGR Y 
Sbjct: 121 E VKGGYI I KVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 180 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 24 0 

TTNDGYVFNPAD I IEDTGNAYI VPHGGHYHYI P NMQPSQLS 
Sbjct: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLS 240 

Query; 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXX 360 

TPNGVAI PHGDHYHF I PYSKLSALEEKIARRVPI SGTGSTVSTNAKPNEW 
Sbjct: 301 TPNGVAI PHGDHYHFIPYSKLSALEEKIARRVP I SGTGSTVSTNAKPNEWSSLGSLSSN 360 

Query : 3 61 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 42 0 
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KELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 4 80 

TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query : 541 KEKNAI IYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 6 0 0 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
Sbjct : 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 WLLKNSTFNNQNFTLANGQK^VSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

VNLLKNSTFNWQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct: 601 WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 660 

Query: 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbjct : 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

Query: 721 GDT YLRVNPQFAVPKGTDAL VR VFDE FHGNAYLENN YKVGE I KL PIP KLNQGTTRTAGNK 780 

GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 
Sbjct : 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 840 

IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 
Sbjct : 781 I PVTFMANAYLDNQST YI VE VP I LEKENQTDKPS I LPQFKRNKAQENS KLDEKVEEPKTS 84 0 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 90 0 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
Sbjct : 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

Query : 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 96 0 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct : 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 102 0 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 
Sbj ct : 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 103 9 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 103 9 



tr Q9ANY1 Pneumococcal histidine triad protein E precursor 
(Hypothetical 

protein SP1004) [phtE] [Streptococcus pneumoniae] 



Score = 2006 bits (5196), Expect = 0.0 

Identities = 998/1039 (96%), Positives = 1000/1039 (96%) 
Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 



1039 
AA 

align 
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MKPSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 
Sbj ct : 1 MKFSKKYIAAGSAVIVSLSLCAYAIiNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 
Sbjct: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

Query: 121 EVKGK3YIIKVDGKYYVYLKDAAHADNVRTOT 180 

EVKGGYI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRY 
Sbj ct: 121 EVKGGYIIK^GKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRY 18 0 

Query: 181 TTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS 24 0 

TTNDG Y VFNPAD I I EDTGNAY I VPHGGH YH Y I P NMQ PSQLS 

Sbjct: 181 TTNDGYVFNPAD 1 1 EDTGNAYI VPHGGHYH YI PKSDLS AS ELAAAKAHLAGKNMQPSQLS 240 

Query: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 3 00 

YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 
Sbjct: 241 YSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 300 

Query: 301 TPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXX 360 

TPNGVAI PHGDHYHF1 PYSKLSALEEKI AR VPISGTGSTVSTNAKPNEVV 
Sbjct: 301 TPNGVAI PHGDH YHF I PYSKLSALEEKI ARMVP I SGTGS TVSTNAKPNE WS SLGSLS SN 360 

Query: 361 XXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

KELSSASDGY1FNPK.DIVEETATAY1VRHGDHFHYIPKSNQ1GQPTLPNNSLA 
Sbjct: 361 PSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA 420 

Query: 421 TPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 4 80 

TPSPSLPINPG SHEKHEEDGYGFDANRIIAEDESGF+MSHG+HNHYFFKKDLTEEQIKA 
Sbjct: 421 TPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKA 480 

Query: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

AQKHLEEVKTSHNGLDSLSSHEQDYP NAKEMKDLDKKIEEKIAGIMKQYGVKRES IWN 
Sbjct: 481 AQKHLEEVKTSHNGLDSLSSHEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

Query: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 6 00 

KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 
Sbjct: 541 KEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNV 600 

Query: 601 VNLLKNS TFNNQNFTLANGQKRVS FS FPPELEKKLG INMLVKL I TPDGKVLEKVSGKVFG 660 

WLLKNSTFNNQNFTLANGQKRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFG 
Sbjct : 601 VNLLKNS TFNNQNFTLANGQKRVS FS FPPELEKKLG INMLVKL I TPDGKVLEKVSGKVFG 660 

Query : 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 72 0 

EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 
Sbjct : 661 EGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHA 720 

Query : 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 
Sbjct: 721 GDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNK 780 

Query: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTS 84 0 

IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQEN KLDEKVEEPKTS 
Sbjct: 781 IPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENLKLDEKVEEPKTS 840 

Query: 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 

EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 
Sbjct : 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 900 
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Query: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 
Sbjct: 901 SGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP 960 

Query: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 
Sbjct: 961 ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTI 1020 

Query: 1021 ELRLPSGEVIKKNLSDLIA 103 9 

ELRLPSGEVIKKNLSDLIA 
Sbjct: 1021 ELRLPSGEVIKKNLSDLIA 103 9 



tr Q6WNQ5 Surface protein BVH-3 (Fragment) [bvh-3] [Streptococcus 1019 
pneumoniae] ^ 

al ign 

Score = 1968 bits (5099), Expect =0.0 

Identities = 977/1019 (95%), Positives = 979/1019 (95%) 

Query: 21 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 8 0 

CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKI'TDQGYV 
Sbjct: 1 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 60 

Query: 81 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKD 14 0 

TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 
Sbjct: 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 12 0 

Query: 141 AAHADNVRTKDEINRQKQEHVKDNEKWSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 2 00 

AAHADNVRTKDE INRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPAD I IEDTGNA 
Sbjct: 121 AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 180 

Query: 201 YIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSK 260 

YIVPH GHYHYIP NMQPSQLSYSSTASDMNTQSVAKGSTSK 
Sbjct: 181 YIVPHRGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 240 

Query: 261 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 320 

PANKS ENLQSLLKEL YDS PS AQRYS ESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I P YSK 
Sbjct: 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 300 

Query: 321 LSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFN 3 80 

LSALEEKIAR VPISGTGSTVSTNAKPNEW KELSSASDGYIFN 
Sbjct: 301 LS ALEEKI ARMVP I S GTGS TVS TNAKPNEVVSSLGSLSSNPSSLTTS KELSSASDGYIFN 360 

Query: 3 81 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEED 44 0 

PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG SHEKHEED 
Sbjct: 361 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 420 

Query: 441 GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 500 

GYGFDANRIIAEDESGF+MSHG+HNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 
Sbjct: 421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 480 

Query: 501 HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 560 

HEQDYP NAKEMKDLDKKIEEKI AGIMKQYGVKRES I VVNKEKNAI I YPHGDHHHADP ID 
Sbjct: 481 HEQDYPSNAKEMKDLDKKI EE KI AGIMKQYGVKRES IWNKEKNAI I YPHGDHHHADP ID 540 
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Query: 561 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTLANGQ 620 

EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNVWLLKNSTFNNQNFTLANGQ 
Sbjct: 541 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQNFTLANGQ 600 

Query: 621 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 680 

KRVSFSFPPELEKKLGIWMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 
Sbjct: 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 

Query: 681 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 740 

FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 
Sbjct: 661 FKYTIASKDYPEVSYDGTFTVPTSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDAL 72 0 

Query: 741 VRVFDE FHGNAYLENNYKVGE I KL P I PKLNQGTTRTAGNKI P VTFMANAYLDNQS T Y I VE 800 

VRVFDEFHGNAYLENNYKVGEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVE 
Sbjct: 721 VRVFDE FHGNAYLENNYKVGE I KLP I PKLNQGTTRTAGNKI PVTFMANAYLDNQS T YI VE 780 

Query: 801 VPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 8 60 

VPILEKENQTDKPSILPQFKRNKAQENSK DEKVEEPKTSEKVEKEKLSETGNSTSNSTL 
Sbjct: 781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

Query: 861 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 92 0 

EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 
Sbj ct : 841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGN 900 

Query: 921 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 98 0 

GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 
Sbj Ct : 901 GENKP S ENGKVS TGT VENQPTENKP AD S L PE APNE KP VKPENS TDNGMLNPEGNVGSD PM 960 

Query: 981 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1039 

LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 
Sbjct: 961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPSGEVIKKNLSDLIA 1019 



tr Q8CWR4 Histidine Motif -Containing protein [phpA] [Streptococcus 855 
pneumoniae pj± 
(strain ATCC BAA-255 / R6)] a ij qn 



Score = 442 bits (1137), Expect = e-122 

Identities = 219/369 (59%), Positives = 271/369 (73%), Gaps = 21/369 (5%) 

Query : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+A A +V LS+CAY L H++Q K+NNRVSY+DG Q++QK+ENLTPD+VS 
Sbjct : 12 MKINKKYLAGSVATLV-LSVCAYELGLHQAQTVKENNRVSYIDGKQATQKTENLTPDEVS 7 0 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

++EGI AEQ I VI KI TDQG YVTSHGDH YHYYNGKVP YD A+ SEELLMKDPNYQLKD DI++ 
Sbjct: 71 KREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAI ISEELLMKDPNYQLKDEDI IS 130 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNE KVNS NVAVAR S 176 

E + KGG Y-f- 1 KVDGKY YVYLKDAAHADNVRTK+E INRQKQEH + E + + VA+ARS 

Sbjct: 131 E I KGG YVI KVDGKY YVYLKDAAHADNVRTKEE INRQKQEHSQHREGGTPRNDGAVALARS 190 

Query: 177 QGR YTTNDG YVFNP AD I I EDTGNAY I VPHGGH YH Y I PXXXXXXXXXXXXXXXXXXXNMQ P 236 

QGRYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP 
Sbjct: 191 QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAKAFLSGRGNLS 250 
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Query: 237 SQLSYSSTASDN NTQSVAKGSTSKPANKSENLQSLLKELYDSPS 280 

+ +Y SDN NT + +T+ A++S + + SLLK+LY P 

Sbjct: 251 NSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPL 310 

Query: 281 AQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGS T 340 

+QR+ ESDGL+FDPA+I SRT NGVA-f PHGDHYHF IP YS+LS LEEK+AR +P+ + 
Sbjct: 311 SQRHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNH 370 

Query: 341 VSTNAKPNE 34 9 

+++P + 

Sbjct: 371 WVPDSRPEQ 379 
Score =84.0 bits (206), Expect = le-14 

Identities = 111/511 (21%), Positives = 190/511 (36%), Gaps = 70/511 (13%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEEL LMKDPNYQLKDADI 118 

+1+ T Y+ HGDHYHY +P + L + EL L Y+ + ++D 

Sbjct: 207 I IEDTGDAYI VPHGDHYHY 1 PKNEL S AS EL AAAKAFLS GRGNLSNS RT YRRQNS D - 261 

Query: 119 WEVKGGYI I KVDGKYYVYLKDAAHADNVRTKDEINRQKQ - EHVKDNEKVNSNVAVARSQ 177 

N + ++V NT+NQ D + + + 

Sbjct: 262 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLS 311 

Query: 178 GRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXX 232 

R+ +DG 4-F+PA I T N VPHG HYH+IP 
Sbjct: 312 QRHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHW 371 

Query: 233 - -NMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGL 290 

+ +P Q S ST + + A P+N + + L+KE E +G+ 

Sbjct: 372 VPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPID- -EKLVKEAVRKVGDGYVFEENGV 429 

Query: 291 V-FDPAKI IS -RTPNGVAI PHGDHYHF I PYSKLS ALEEKI ARRVPISGTGS TVS TNAKPN 348 

+ PAK +S T G+ SKL A +E ++ T S N 

Sbjct: 430 PRY I P AKD L S AE T AAG I D SKL-AKQESLSHKLGAKKTDLPSSDREFYN 476 

Query: 349 EWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQ 408 

+ + A D + KD+ + D + + 

Sbjct: 477 KAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLV DDILAFLAPIRH 532 

Query: 409 IGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYF 468 

+ PN + + + ++ + EDGY FD I + + + + + H H+H+ 

Sbjct: 533 PERLGKPNAQ ITYTDDE IQVAK - LAGKYTTEDGY I FDPRD I TSDEGDAYVTPHMTHSHW I 591 

Query: 469 FKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIA 524 

K L+E + AAQ + +E GL S+ QD GN AK + + + + + +K+ 

Sbjct: 592 KKD S L S E AERAAAQ A Y AKE KGLTPPSTDHQD - SGNTEAKGAEAI YNRVKAAKKVP 645 

Query: 525 GIMKQYGVKRESIWNKEKNAIIYPHGDHHH 555 

Y ++ V + ++I PH DH+H 

Sbjct: 646 LDRMPYNLQ YTVEVKNGSLI I PHYDHYH 673 

Score = 68.2 bits (165), Expect = 8e-10 

Identities = 49/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 
Query: 138 LKDAAHADNVRTKDE IN RQKQEHVKDNE KV NSNVAVARS QGR YTTNDGYVF 188 
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LKD + +D V+ D + I R + K N + + + + VA+ G+YTT DGY+F 

Sbjct: 509 LKDVS -SDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIF 567 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 568 D PRD I T S D E GD A YVT PHMTHS H W I KKD S L S E AERAAAQ A Y AKE KGL T P PSTDHQD 622 

Query: 24 9 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

+ + AKG+ + +Y+ A + D + + + + + IP 

Sbjct: 623 SGNTEAKGA EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIP 667 

Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 668 HYDHYHNIKF 677 

Score = 63.9 bits (154), Expect = 2e-08 

Identities = 74/365 (20%), Positives = 139/365 (37%), Gaps = 85/365 (23%) 

Query: 23 9 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 2 98 

L + T +NN S G + K+ENL + S + +G + + +1 

Sbjct: 37 LHQAQTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 82 

Query: 299 SRTPNGVAIPHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXX 354 

T G HGDHYH+ +PY + + E + + + P + K +4-4- 
Sbjct: 83 KITDQGYVTSHGDHYHYYNGKVPYDAI - - ISEELLMKDP NYQLKDEDII 129 

Query: 355 XXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYI - - VRHGDHFHYIPKSNQIGQP 412 

S GY+ V+ Y+ H D+ 4- N+ Q 

Sbjct: 130 SEIKGGYVIK VDGKYYVYLKDAAHADNVRTKEE INRQKQE 169 

Query: 413 TLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMSHGNHNHYFFKK 471 

+ TP + S ++ +DGY F+A+ II + 4-1+ HG4-H HY K 

Sbjct: 170 HSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKN 229 

Query: 472 DLTEEQIKAAQKHL EEVKTSHNGLDSLS SHEQDYP 506 

+L+ 4-4- AA+ L + + N + S + S S+ + 

Sbjct: 230 ELSASELAAAKAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTN 289 

Query: 507 GNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKE KNA 1 1 Y PHGDHHHAD P I D 560 

A + D+D + + 4- +QV+ + 4- 4-4-4- N 4- PHGDH+H P 

Sbjct: 290 SQASQSNDIDSLLKQLYKLPLSQRHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPYS 349 

Query: 561 EHKPV 565 

+ P+ 

Sbjct: 350 QLSPL 354 
Score =42.7 bits (99), Expect = 0.036 

Identities = 44/186 (23%), Positives = 63/186 (33%), Gaps = 36/186 (19%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

4- DGYIF4-P+DI + AY+ H H H+I K + Q L PS 

Sbjct: 560 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 619 

Query: 426 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+ G + K E D++ + I+H+HH 

Sbjct: 62 0 HQ - DSGNTEAKGAEAI YNRVKAAKKVPLDRMP YNLQYTVE VKNGSL I I PHYDHYHNI KFE 678 
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Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K+E+AK HE SNG+SHQ + + 

Sbjct: 679 WFDEGLYEAPKGYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 73 8 

Query: 517 KKIEEK 522 

K EEK 
Sbjct: 73 9 KPNEEK 744 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP I S + PH H H+I LS E A+ 
Sbjct: 549 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 608 

Query: 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + d +N + V 

Sbjct: 609 KE KGLTPPS TDHQD S GNTE AKGAE AI YNRVKAAK KVPLDRMPYNLQYTV 657 

Query: 386 EETATAYIVRHGDHFHYI 403 

E + 1+ H DH+H I 

Sbjct: 658 E VKNGS L 1 1 PH YDHYHN I 675 



tr QIOTQ? Pneumococcal histidine triad protein A [phtA] [Streptococcus 82 8 AA 
pneumoniae (strain ATCC BAA-255 / R6)] 

align 

Score = 437 bits (1125), Expect = e-121 

Identities = 218/369 (59%), Positives = 271/369 (73%), Gaps - 21/369 (5%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+ GSA + LS+C+Y L + + + + K+NNRVSY+DG Q+ + QK+ENLTPD+VS 
Sbjct: 13 MKINKKYLV-GSAAALILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVS 71 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 120 

++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+FSEELLMKDPNY+LKD DIVN 
Sbjct: 72 KREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVN 131 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVICDNE KVNS NVAVAR S 176 

EVKGGY+ IKVDGKYYVYLKDAAHADNVRT.K+E INRQKQEH 4- E + + VA+ARS 

Sbjct: 132 E VKGGYVI KVDGKYYVYLKDAAHADNVRTKEE INRQKQEHS QHREGGTPRNDGAVALARS 191 

Query: 177 QGRYTTNDGYVFNPAD I IEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQP 236 

QGRYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP 
Sbjct: 192 QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLS 251 

Query: 237 SQLSYSSTASDN NTQSVAKGSTSKPANKSENLQSLLKELYDSPS 280 

+ +Y S DN NT + +T+ A++S ++ SLLK+LY P 

Sbjct: 252 NSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPL 311 

Query: 281 AQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGST 340 

+ QR+ ESDGLVFDPA+I SRT GVA+ PHGDH YHF I P YS + + S LEE + IAR +P+ + 
Sbjct: 312 SQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNH 371 
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Query: 341 VSTNAKPNE 34 9 

+++P + 

Sbjct: 372 WVPDSRPEQ 3 80 



Score =77.8 bits (190), Expect = le-12 

Identities = 103/504 (20%), Positives = 189/504 (37%), Gaps = 60/504 (11%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEEL LMKDPNYQLKDADI 118 

+ 1+ T Y+ HGDHYHY + P + L 4- EL L Y+ 4 + 4-D 

Sbjct: 208 I IEDTGDAYIVPHGDHYHY IPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSD- 262 

Query: 119 WEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEIN^ 177 

N + ++V NT+NQ D 4 + + 

Sbjct: 263 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLS 312 

Query: 178 GRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPS 237 

R+ +DG VF+PA I T VPHG HYH+IP 4- P 

Sbjct: 313 QRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIA RIIPL 365 

Query: 23 8 QLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSE - SDGLVFDPAK 2 96 

+ + D-f + + T +P+ + +L + S 4-Q 4- 4-G VF+ K 

Sbjct: 366 RYRSNHWVPDSRPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEE-K 424 

Query: 297 IISRTPNGVAIPHGDHYHF 1 PYSKLSALEEKI ARRVPI SGTGSTVSTNAKPNE WXX 353 

ISR Y F +P + LE K+44-4- + S T + N P + 

Sbjct: 425 GISR YVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFY 472 

Query: 354 XXXXXXXXXXXXXXKELSSASDGYIFNPK- -DIVEETATAYIVRHGDHFHYIPKSNQIGQ 411 

E + + k 4- 4- + 4-T D +4- 4- 

Sbjct: 473 DKAYNLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAPITHPER 532 

Query: 412 PTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKK 4 71 

PN+ + + I 4- + 4- DGY FD + II + 4-4- + + H H4-H+ K 

Sbjct: 533 LGKPNSQIEYTEDEVRIAQ-LADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKD 591 

Query: 4 72 DLTEEQ I KAAQKHLEE VKTSHNGLDSLS SHEQDYPGNAKEMKDLDKKI EEKI AG IMKQYG 531 

L + 44-4- AAQ + +E D + + + G+4- K E + + I + Y 

Sbjct: 592 SLSDKEKVAAQAYTKEKGILPPSPD--ADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYM 649 

Query: 53 2 VKRES I WNKEKNAI I YPHGDHHH 555 

V+ V + +1 PH DH+H 

Sbjct: 650 VEH TVE VKNGNL I I PHKDH YH 670 



Score = 60.1 bits (144), Expect = 2e-07 

Identities = 73/372 (19%) , Positives = 131/372 (34%) , Gaps = 107/372 (28%) 

Query: 239 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 298 

L + T +NN S G 4- K+ENL + S+ 4-G+ + +1 

Sbjct: 38 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 83 

Query: 299 SRTPNGVAIPHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WXXX 354 



T G HGDHYH+ +PY + + 
Sbjct: 84 KITDQGYVTSHGDHYHYYNGKVPYDAIFS 112 

Query: 355 XXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR HGDHFH 401 
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+JSL Y +DIV E Y+ + + H D+ 

Sbjct: 113 EELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVR 159 

Query: 402 YIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMS 460 

+ N+ Q + TP + ■ s ++ +DGY F+A+ II + +1+ 

Sbjct: 160 TKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVP 219 

Query: 461 HGNHNHYFFKKDLTEEQIKAAQKHL EEVKTSHNGLDSLS 499 

HG+H HY K +L+ ++ AA+ L + + N + S + S 

Sbjct: 220 HGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTT 279 

Query: 500 SHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIV VNKEKNAIIYP 54 9 

S+ + A + D+D + + + + Q V+ + +V + p 

Sbjct: 280 NTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPAQITSRTARGVAVP 33 9 

Query: 550 HGDHHHAD P I D E 561 

HGDH+H P + 
Sbjct: 340 HGDHYHFI P YSQ 351 



Score =57.0 bits (136), Expect = 2e-06 

Identities = 36/155 (23%), Positives = 57/155 (36%), Gaps = 20/155 (12%) 

Query: 165 EKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXX 224 

E V +A4- +YTT+DGY+F+ DII D G+AY-f PH GH H+I 

Sbjct: 541 EYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKDSLSDKEKVA 600 

Query: 225 XXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRY 2 84 

+ P A+ + A +Y+ + + 

Sbjct: 601 AQAYTKEKG I LPPS PDADVKANPTGDS AAA IYNRVKGEKR 640 

Query: 2 85 SESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYS 319 

L + + + IPH DHYH I ++ 

Sbjct: 641 I PLVRLP YMVEHTVEVKNGNL I I PHKDHYHNI KFA 675 



tr Q 9 AG 7 4 PhpA [phpA] [Streptococcus pneumoniae] 844 AA 

align 

Score = 437 bits (1124) , Expect = e-121 

Identities = 216/369 (58%), Positives = 271/369 (72%), Gaps = 21/369 (5%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+ GSA + LS + C+Y L + + + + K-f NNRVSY+DG Q++QK+ENLTPD+VS 
Sbjct : 1 MKINKKYLV-GSAAALILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVS 59 

Query: 61 QKEG I QAEQI VI KI TDQG YVTSHGDHYH YYNGKVP YDALFSEELLMKDPNYQLKDAD I VN 12 0 

++EGI AEQIVIKI TDQG YVTSHGDHYH YYNGKVP YDA+ SEELLMKDPNYQLKD DI + + 
Sbjct: 60 KREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDEDIIS 119 

Query: 121 E VKGG YI I KVDGKY YVYLKDAAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARS 176 

E+KGGY+IKVDGKYYVYLKDAAHADNVRTK+EINRQKQEH + E + + VA+ARS 

Sbjct: 12 0 E I KGG YV I KVDGKY YVYLKDAAHADNVRTKEE INRQKQEHSQHREGGTPRNDGAVALARS 179 

Query: 177 QGR YTTNDG YVFNP AD I I EDTGNAY I VPHGGH YH Y I PXXXXXXXXXXXXXXXXXXXNMQP 236 

QGRYTT+DGY+FN +DI IEDTG+AYI VPHG HYHYIP 
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Sbjct: 180 QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAKAFLSGRGNLS 23 9 

Query: 237 SQLSYSSTASDN NTQSVAKGSTSKPANKSENLQSLLKELYDSPS 280 

+ +Y SDN NT + +T+ A++S ++ SLLK+LY P 

Sbjct: 240 NSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKX.pl 299 

Query: 281 AQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGST 340 

+QR~f- ESDGL+FDPA+I SRT NGVA+PHGDHYHFIPYS+LS LEEK+AR +P+ + 
Sbjct: 300 SQRHVESDGL I FDPAQ I TSRTANGVAVPHGDHYHF I P YS QLS PLEEKLARI I PLRYRSNH 359 

Query: 341 VSTNAKPNE 34 9 

+ + + P + 

Sbjct: 3 60 WVPDSRPEQ 3 68 
Score = 84.0 bits (206), Expect = le-14 

Identities = 111/511 (21%), Positives = 190/511 (36%), Gaps = 70/511 (13%) 

Query: 71 V I K I TD QG Y VTS HGDH YH Y YNG KVP YD AL F S E E L LMKDPNYQLKDADI 118 

+1+ T Y+ HGDHYHY +P + L + EL L Y+ +++D 

Sbjct: 196 I IEDTGDAYIVPHGDHYHY IPKNELSASELAAAKAFLSGRGNLSNSRTYRRQNSD - 250 

Query: 119 WEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQ-EHVKDNEKVNSNVAVARSQ 177 

N + ++ V NT+NQ D + + + 

Sbjct: 251 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLS 300 

Query: 178 GRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXX 232 

R+ +DG + F+PA I T N VPHG HYH+IP 
Sbjct: 301 QRHVESDGL I FDPAQ I TSRTANGVAVPHGDHYHF I PYS QLS PLEEKLARI I PLRYRSNHW 360 

Query: 233 - -NMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGL 290 

+ +P Q S ST + + A P+N + + L+KE E +G+ 

Sbjct: 361 VPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPID- -EKLVKEAVRKVGDGYVFEENGV 418 

Query: 291 V - FDPAKI IS- RTPNGVAI PHGDHYHF I P YSKLS ALEEKI ARRVP I SGTGSTVSTNAKPN 348 

+ PAK +S T G+ SKL A +E ++ ++ T S N 

Sbjct: 419 PRYI PAKDLSAETAAGID SKL-AKQESLSHKLGAKKTDLPSSDREFYN 465 

Query: 34 9 EWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQ 4 08 

+ + A D + KD+ + D + + 

Sbjct: 466 KAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLV DD I L AF LAP I RH 521 

Query: 4 09 IGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYF 4 68 

+ PN + + + ++ + EDGY FD I +++ ++ H H+H+ 

Sbjct: 522 PERLGKPNAQITYTDDEIQVAK-LAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWI 580 

Query: 469 FKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIA 524 

K L+E + AAQ + +E GL S+ QD GN AK + + +++ +K+ 

Sbjct: 581 KKD S L S E AERAAAQAYAKE KGLTPPSTDHQD - SGNTEAKGAEAI YNRVKAAKKVP 634 

Query: 525 GIMKQYGVKRES IWNKEKNAI I YPHGDHHH 555 

Y ++ V + ++I PH DH+H 

Sbjct: 635 LDRMPYNLQ YTVE VKNGS L I I PH YDH YH 662 

Score =68.2 bits (165), Expect = 8e-10 

Identities = 49/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 
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Query: 138 LKDAAHADNVRTKDEIN- RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 

LKD + +D V+ D + I R + K N + + + + VA+ G+YTT DGY+F 

Sbjct: 498 LKDVS -SDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIF 556 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

4-P DI D G+AY+ PH H H+I + p ST + 

Sbjct: 557 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 611 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

+ -f AKG+ + +y+ A + D + + 4- IP 

Sbjct: 612 SGNTEAKGA EAI YNRVKAAKKVPLDRMP YNLQ YTVE VKNGSL 1 1 P 656 

Query: 309 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 657 HYDHYHNIKF 666 

Score =63.2 bits (152), Expect = 3e-08 

Identities = 74/365 (20%), Positives = 139/365 (37%), Gaps = 85/365 (23%) 

Query: 23 9 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 2 98 

L + T +NN S G + K+ENL + S+ +G+ 4- +1 

Sbjct: 26 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 71 

Query: 299 SRTPNGVAIPHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXX 354 

T G HGDHYH+ +PY + + E++ + P 4 K + + + 
Sbjct: 72 KITDQGYVTSHGDHYHYYNGKVPYDAI - -ISEELLMKDP NYQLKDEDII 118 

Query: 355 XXXXXXXXXXXXXKELSSASDGYIFNPKDI VEETATAYI - - VRHGDHFHYIPKSNQIGQP 412 

S GY+ V+ Y+ H D+ 4- N+ Q 

Sbjct: 119 SEIKGGYVIK VDGKYYVYLKDAAHADNVRTKEEINRQKQE 158 

Query: 413 TLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMSHGNHNHYFFKK 471 

+ TP + S 4-4 4-DGY F+A+ II 4 +14 HG4H HY K 

Sbjct: 159 HSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKN 218 

Query: 472 DLTEEQIKAAQKHL EEVKTSHNGLDSLS SHEQDYP 506 

+L+ 4-+ AA4- L 4- 4- N 4- S4-S S+ + 

Sbjct: 219 ELSASELAAAKAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTN 278 

Query: 507 GNAKEMKDLDKKIEEKIAGIMKQYGVKRES I WNKE KNAI I YPHGDHHHAD P I D 560 

A + D-fD 4-4-4- 4-QV4 4-4- 4-++N4- PHGDH+H P 

Sbjct: 279 SQASQSNDIDSLLKQLYKLPLSQRHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPYS 338 

Query: 561 EHKPV 5 65 

4- P + 

Sbjct: 339 QLSPL 343 
Score =42.7 bits (99), Expect = 0.036 

Identities = 44/186 (23%), Positives = 63/186 (33%), Gaps = 36/186 (19%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI 4 AY 4- H H H+I K + Q L PS 

Sbjct: 549 TTEDGY I FDPRD I TSDEGDAYVTPHMTHSHW I KKDSLS EAERAAAQAYAKEKGLTPPSTD 608 

Query: 426 LPINPGISHEKHEE DGYGFDANRI I AEDESGF IMSHGNHNH 466 

+ G + K E D++ + I+H+HH 
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Sbjct: 609 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 667 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDS LS SHEQD YPGNAKEMKDLD 516 

Y K + E + A K HE'SNG+SHQ + + 

Sbjct: 668 WFDEGLYEAPKGYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 72 7 

Query: 517 KKIEEK 522 

K EEK 
Sbjct: 72 8 KPNEEK 733 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG + FDP I S + PH H H+I LS E A+ 
Sbjct: 53 8 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 597 

Query: 33 3 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + D +N + V 

Sbjct: 598 KEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAK KVPLDRMPYNLQYTV 646 

Query: 3 86 EETATAY I VRHGDHFH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 64 7 E VKNGSL I I PHYDHYHNI 664 



tr Q 9AHT9 Pneumococcal histidine triad A protein [phtA] [Streptococcus 816 AA 
pneumoniae] 

a lign 

Score = 435 bits (1119) , Expect = e-120 

Identities = 217/369 (58%), Positives = 270/369 (72%), Gaps = 21/369 (5%) 

Query : 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+ GSA + LS+C+Y L ++++ K+NNRVSY+DG Q++QK+ENLTPD+VS 
Sbj ct : 1 MKINKKYLV-GSAAALILSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVS 59 

Query: 61 QKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVN 12 0 

++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+ SEELLMKDPNY+LKD DIVN 
Sbjct : 60 KREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYKLKDEDIVN 119 

Query: 121 EVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNE KVNS NVAVARS 176 

E VKGG Y+ I KVDGKYYVYLKDAAHADNVRTK+E INRQKQEH + E + + VA+ARS 

Sbjct: 12 0 EVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARS 179 

Query: 177 QGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQP 236 

QGRYTT+DGY+FW +DI IEDTG+AYIVPHG HYHYIP 
Sbjct: 180 QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLS 239 

Query: 237 SQLSYSSTASDN NTQS VAKGSTSKPANKSENLQSLLKELYDSPS 2 80 

+ +Y SDN NT + +T+ A++S ++ SLLK+LY P 

Sbjct: 240 NSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPL 299 

Query: 281 AQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGST 34 0 

+QR+ ESDGLVFDPA+I SRT GVA+ PHGDH YHFI P YS 4- + S LEE + IAR +P+ + 
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Sbjct: 300 SQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNH 359 

Query: 341 VSTNAKPNE 34 9 

+ + + P -f 

Sbjct: 360 WVPDSRPEQ 368 



Score = 75.5 bits (184), Expect = 5e-12 

Identities = 102/504 (20%), Positives = 188/504 (37%), Gaps = 60/504 (11%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEEL LMKDPNYQLKDAD I 118 

+ 1 + T Y+ HGDHYHY +P + L + EL L Y+ +++D 

Sbjct: 196 I IEDTGDAYI VPHGDHYHY IPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSD- 250 

Query: 119 VNEVKGGYI I KVDGKYYVYLKXIAAHADNVRTKDE INRQKQ - EHVKDNEKVNSNVAVARS Q 177 

N + ++ V NT+NQ D+-+ + 

Sbjct: 251 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLS 300 

Query: 178 GRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPS 237 

R+ 4-DG VF+PA I T VPHG HYH+IP + p 

Sbjct: 301 QRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIA RIIPL 353 

Query: 238 QLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSE-SDGLVFDPAK 296 

+ + D+ + + T +P+ + +L + S +Q + +G VF+ K 

Sbjct: 354 RYRSNHWVPDSRPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEE-K 412 

Query: 297 1 1 SRTPNGVAI PHGDHYHF 1 PYS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WXX 353 

ISR Y F +P + LE K+ + + 4- +S T + N P + 

Sbjct: 413 GISR YVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFY 460 

Query: 354 XXXXXXXXXXXXXXKELSSASDGYIFNPK- -DIVEETATAYIVRHGDHFHYIPKSNQIGQ 411 

Sbjct: 461 DKAYNLLTEAHKALFXNKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAPITHPER 520 

Query: 412 PTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKK 471 

PN+ + + I ++ + DGY FD + II+++ ++ H K 

Sbjct: 521 LGKPNSQIEYTEDEVRIAQ-LADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKD 579 

Query: 4 72 DLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYG 531 

+ + + AAQ + +E D + + + G++ K E + + I + Y 

Sbjct: 580 SLSDKEKVAAQAYTKEKGILPPSPD--ADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYM 637 

Query: 53 2 VKRES IWNKEKNAI I YPHGDHHH 555 

V+ V + +1 PH DH+H 

Sbjct: 638 VEH TVE VKNGNL I I PHKDHYH 658 



Score =60.1 bits (144), Expect = 2e-07 

Identities = 73/372 (19%), Positives = 131/372 (34%), Gaps = 107/372 (28%) 

Query: 2 39 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 2 98 

L + T +NN S G + K+ENL + S+ +G+ + +1 

Sbjct: 26 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 71 

Query: 299 SRTPNGVAI PHGDHYHF 1 PYS KLSALEEKIARRVP I SGTGSTVSTNAKPNE WXXX 354 

T G HGDHYH+ + PY + + 
Sbjct: 72 KI TDQGYVTS HGDHYHY YNGKVPYDAI I S 100 
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Query: 355 XXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR HGDHFH 401 

+EL Y +DIV E Y+++ H D+ 

Sbjct: 101 EELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVR 147 

Query: 402 YIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMS 460 

+ N+ Q + TP + S ++ +DGY F+A+ II + +1 + 

Sbjct: 148 TKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVP 207 

Query: 461 HGNHNHYFFKKDLTEEQ I KAAQKHL EEVKTSHNGLDSLS 499 

HG+H HY K +L + ++ AA+ L + + N + S + S 

Sbjct: 208 HGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTT 2 67 

Query: 500 SHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIV VNKEKNAIIYP 549 

S + + A + D+D + + + + Q V+ + +V + + + P 

Sbjct: 268 NTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPAQITSRTARGVAVP 327 

Query: 55 0 HGDHHHADPIDE 561 

HGDH+H P + 
Sbjct: 328 HGDHYHFIPYSQ 339 



Score =57.0 bits (136), Expect = 2e-06 

Identities = 36/155 (23%), Positives = 57/155 (36%), Gaps = 20/155 (12%) 

Query: 165 E KVNSNVAVARS QGR YTTNDG YVFNP AD 1 1 EDTGNAY I VPHGGH YH Y I PXXXXXXXXXXX 224 

E V +A+ +YTT+DGY+F+ DII D G+AY + PH GH H+I 

Sbjct: 529 EYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKDSLSDKEKVA 588 

Query: 22 5 XXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRY 2 84 

+ P A+ + A +Y+ + + 

Sbjct: 58 9 AQAYTKEKGILPPSPDADVKANPTGDSAAA IYNRVKGEKR 62 8 

Query: 285 SESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYS 319 

L + + + IPH DHYH I 

Sbjct: 629 I PLVRL P YMVEHTVE VKNGNL I I PHKDHYHNI KFA 663 



tr Q8DQ08 Pneumococcal histidine triad protein D [phtD] [Streptococcus 85 3 AA 
pneumoniae (strain ATCC BAA-255 / R6)] 

al ign 



Score = 434 bits (1117) , Expect = e-120 

Identities = 218/357 (61%) , Positives = 272/357 (76%) , Gaps = 11/357 (3%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQV 5 9 

MK +KKY+A GS +++LS+C+Y L +H+ + Q K++NRVSY+DG Q+ QK+ ENLTPD+V 
Sbj ct : 1 MKINKKYLA-GSVAVLALSVCSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEV 5 9 

Query : 60 SQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLICDADIV .119 

S++EGI AEQ I VI KI TDQG Y VTSHGDH YHY YNGKVP YDA+ SEELLMKDPNYQLKD+DIV 
Sbjct : 60 SKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIV 119 

Query: 12 0 NE VKGGYI I KVT^GKYYVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVAVARS QG 178 

NE+KGGY+IKVDGKYYVYLKDAAHADN+RTK+EI RQKQE + N + VA AR+QG 
Sbjct: 120 NEIKGGYVIKVDGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQG 179 
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Query: 179 RYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQ 238 

RYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP Q S+ 

Sbjct: 180 RYTTDDGYI FNASDI IEDTGDAYIVPHGDHYHYIP - - KSDLSASELAAAQAYWNGKQGSR 23 7 

Query: 239 LSYSSTASDNNTQ SVAKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVF 292 

S SS+ + N Q S TP N+ EN+ SLL+ELY P ++R+ ESDGL+F 

Sbjct: 238 PSSSSSHNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIF 297 

Query: 2 93 DPAKI I SRTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGS TVS TNAKPNE 34 9 

DPA+I SRT NGVA+PHGDHYHFIPYS+LS LEEK+AR +P+ + + + + P + 

Sbjct: 298 DPAQITSRTANGVAVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQ 354 



Score = 84.3 bits (207), Expect = le-14 

Identities = 90/385 (23%), Positives = 134/385 (34%), Gaps = 73/385 (18%) 

Query: 193 IIEDTGNAYIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNM 234 

+1+ T Y+ HG HYHY P + 

Sbjct : 71 VI KITDQGYVTSHGDHYHY YNGKVP YDAI I S EELLMKDPNYQLKDSD I VNE I KGGYVI KV 130 

Query: 235 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 2 93 

Y A+ + + + +S N S + AQ RY+ DG +F + 

Sbjct: 131 DGKYYVYLKDAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYI FN 190 

Query: 294 PAKI I SRTPNGVAI PHGDHYHF I PYS KLS ALEEKI ARRVP I SGTGS TVST NAKP-- 347 

+11 T 4- +PHGDHYH+IP S LSA E A+ GS S+ NA P 

Sbjct: 191 ASDIIEDTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQ 250 

Query: 348 NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNPKDIVEETATAY 392 

N V +EL + SDG IF+P I TA 

Sbjct: 251 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGV 310 

Query: 3 93 I VRHGDHFH Y I P KS -NQIGQPTLPNNSLATPSPSLP 427 

V HGDH+H+IP S + + QP+ + + PSP 

Sbjct: 311 AVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPA 370 

Query: 428 INPGI SHEKHEEDGYGFDANRI I AEDESGF IMSHGNHNHYFFKKDLTEEQI KAAQKHL - E 486 

NP + + + +A R + + G++ Y KDL+ E L + 

Sbjct: 371 PNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGVPRYIPAKDLSAETAAGIDSKLAK 427 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ SH L++DP++E 
Sbjct: 428 QESLSH KLGAKKTDLPS SDRE 448 



Score = 68.2 bits (165), Expect = 8e-10 

Identities = 49/190 (25%) , Positives = 80/190 (41%) , Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 

LKD + +D V+ D+I R + K N + 4- +4- VA+ G+YTT DGY+F 

Sbjct: 484 LKDVS - SDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYI F 542 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 543 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 597 

Query: 24 9 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 3 08 

+ + AKG+ + +Y+ A + D + ++ + + IP 
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Sbjct: 598 SGNTEAKGA EAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IP 642 

Query: 309 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 64 3 HYDHYHNIKF 652 



Score = 58.2 bits (139), Expect = 8e-07 

Identities = 60/306 (19%), Positives = 112/306 (35%), Gaps = 84/306 (27%) 

Query: 297 IISRTPNGVAIPHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGS TVSTNAKPNE WX 352 

+ 1 T G HGDHYH+ 4-PY + + 
Sbjct: 71 V I K I TDQG YVT S HGDH YH Y YNGKVP YDA IIS 101 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 406 

4-EL Y DIV E Y+++ G ++ Y+ + 

Sbjct: 102 EELLMKDPNYQLKDSD I VNE I KGG YVI KVDGKY YVYLKDAAHADN 146 

Query: 407 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 45 8 

Q + + 4-NS A + + G +DGY F+A+ II 4- +1 

Sbjct: 147 IRTKEEIKRQKQERSHNHNSRADNAVAAARAQG RYTTDDG YI FNASD I I EDTGDAYI 203 

Query: 459 MSHGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSHNGLDSLSSHEQDY 505 

+ HG+H HY K DL+ + 4- AAQ + 4-SHN + + 4-4- 

Sbjct: 204 VPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTP 263 

Query: 506 PGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKE KNAI I YPHGDHHHAD P I 559 

+ 4- 4-4-4- +EA++V+ + + +++N + PHGDH+H P 

Sbjct: 264 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPY 323 

Query: 560 DEHKPV 565 
+ P+ 

Sbjct: 324 SQLSPL 329 



Score =47.4 bits (111), Expect = 0.001 

Identities = 61/260 (23%), Positives = 90/260 (34%), Gaps = 51/260 (19%) 

Query: 372 SASDGYI FNPKDI VEETATAYIVRHGDHFHYI PKSN QIGQPTLPNNSLATPSPS 425 

+ DGY1F+P+DI + AY+ H H H+I K + Q L PS 

Sbjct: 535 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 594 

Query: 426 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+ G + K E D 4-4- -f 1+ H +H H 

Sbjct: 595 HQ - DSGNTEAKGAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNI KFE 653 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLS SHEQD YPGNAKEMKDLD 516 

Y K+E+AK HE S NG + S H Q + + 

Sbjct: 654 WFDEGLYEAPKGYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 713 

Query: 517 KKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYEL 576 

K EEK + E 4-KE + + P H 4-D 4- H VG+ S N h 

Sbjct: 714 KPNEEK PQTEKPEEDKEHDEVSEP- -THPESDEKENH- -VGLNPSADN- -L 758 

Query: 577 FKPEEGVAKKEGNKVYTGEE 596 

+KP + E T +E 

Sbjct: 759 YKPSTDTEETEEEAEDTTDE 778 
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Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP I S + PH H H+I LS E A+ 
Sbjct: 524 E I QVAKLAGKYTTEDGYI FDPRD ITSDEGDAYVTPHMTHSHW I KKDSLSEAERAAAQAYA 583 

Query: 33 3 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + 4- +T AK E + D +N + V 

Sbjct: 584 KEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAK KVPLDRMPYNLQYTV 632 

Query: 3 86 E E TATAY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 633 E VKNGS L 1 1 PHYDHYHN I 650 



tr Q6T8D7 Pneumococcal protein BVH-11-3 [Streptococcus 817 AA 

pneumoniae] al ign 

Score = 431 bits (1109), Expect = e-119 

Identities = 217/355 (61%), Positives - 266/355 (74%), Gaps = 22/355 (6%) 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVS 60 

MK +KKY+ GSA + LS+C+Y L ++++ K+NNRVSY+DG Q++QK+ENLTPD+VS 



++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+FSEELLMKDPNY+LKD DIVN 



EVKGGY+IKVDGKYYVYLKDAAHADNVRTK+EINRQKQEH + E + + VA+ARS 



QGRYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP 



SQLSYSSTASDN NTQSVAKGSTSKPANKS -ENLQSLLKELYDSP 2 79 

+ +Y SDN NT + +T+ A++S E + 4- SLLK+LY P 



++R+ ESDGLVFDPA+I SRT GVA+PHGDHYHFIPYS++S LEE+IAR +P+ 



Score = 79.7 bits (195), Expect = 3e-13 

Identities = 102/503 (20%), Positives = 189/503 (37%), Gaps = 57/503 (11%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEEL- -LMKDPNYQLKDADI 118 

+1+ T Y+ HGDHYHY +P + L + EL L Y+ +++D 

Sbjct: 196 I IEDTGDAYIVPHGDHYHY 1 PKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSD - 250 

Query: 119 VNEVKGGY 1 1 KVDGKYYVYLKDAAHADNWTKDE INRQKQEHVKDNEKWSNVAVARSQG 178 



Query: 


1 


Sbjct: 


1 


Query : 


61 


Sbjct: 


60 


Query : 


121 


Sbjct: 


120 


Query : 


177 


Sbjct: 


180 


Query : 


237 


Sbjct : 


240 


Query : 


280 


Sbjct : 


300 
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N + ++ V N N Q + +D + + + 

Sbjct: 251 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNGQASQSNEDVDSLLKQLYALPLSK 302 

Query: 179 RYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQ 23 8 

R+ +DG VF+PA I T VPHG HYH+IP + P + 

Sbjct: 303 RHVESDGLVFDPAQ I TSRTARGVAVPHGDH YHF I PYS QMSELEER I A RIIPLR 355 

Query: 239 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSE-SDGLVFDPAKI 297 

+ D-f + + T +P+ + +L + S +Q + +G VF + K 
Sbjct: 356 YRSNHWPDSRLEQPSLQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEE-KG 414 

Query: 2 98 ISRTPNGVAIPHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXX 3 54 

ISR Y F +P + LE K++++ +S T + N P + 

Sbjct: 415 ISR YVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFYD 462 

Query: 355 XXXXXXXXXXXXX KE L S S AS DG Y I FNP K - - D I VE E TAT AY I VRHGDHFH Y I P KS NQ I GQ P 412 

E + + K+ + + +T D++ + 

Sbjct: 463 KAYNLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAPITHPERL 522 

Query: 413 TLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKD 472 

PN-f + + I ++ + DGY FD + II + + + H H+H+ K 

Sbjct: 523 GKPNSQIEYTEDEVRI AQ - LADKYTTSDGYI FDEHDI I SDEGDAYVTLHMGHSHWIGKDS 581 

Query: 4 73 LTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGV 532 

+ + + AAQ + +E D + + + G + + K E + + I + Y V 

Sbjct: 582 LSDKEKVAAQAYTKEKGILPPSPD- -ADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYMV 639 

Query: 533 KRESIWNKEKNAIIYPHGDHHH 555 

+ V . + +1 PH DH+H 

Sbjct: 64 0 EH TVE VKNGNL I I PHKDHYH 65 9 



Score = 59.3 bits (142), Expect = 4e-07 

Identities = 76/340 (22%) , Positives = 111/340 (32%) , Gaps = 111/340 (32%) 

Query: 23 9 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 2 98 

L + T +NN S G + K+ENL + S+ +G+ + +1 

Sbjct: 26 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI - VI 71 

Query: 299 SRTPNGVAIPHGDHYHF IPYSKLSALE EKIARRVP ISG 336 

T G HGDHYH+ +PY + ~h E E I V + G 

Sbjct : 72 KITDQGYVTSHGDHYHYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVTSTEVKGGYVIKVDG 131 

Query: 33 7 TGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSS ASDGYIF 379 

+A + V +E + DGYIF 

Sbjct: 132 KYYVYLKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIF 191 

Query: 38 0 NP KD I VE E T ATA Y I VRHGDH FH Y I P K SNQIGQPTLPN 416 

N DI+E+T AYIV HGDH+HYIPK S + N 

Sbjct: 192 NASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDN 251 

Query: 417 NSLATPSPSLPINPG ISHEKHEEDGY 442 

S PS + NPG +S E DG 

Sbjct: 252 TSRTNWVPSVS-NPGTTNTNTSNNSNTNGQASQSNEDVDSLLKQLYALPLSKRHVESDGL 310 

Query: 44 3 GFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQ 4 82 

FD +1 + G + HG+H H+ ++E + + A+ 

Sbjct: 311 VFDPAQ I TSRTARGVAVPHGDH YHF I PYS QMSELEER I AR 350 
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Score = 53.1 bits (126), Expect = 3e-05 

Identities = 35/155 (22%), Positives = 56/155 (35%), Gaps = 20/155 (12%) 

Query: 165 EKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXX 224 

E V +A+ +YTT+DGY+F+ DII D G+AY + H GH H+I 

Sbjct: 530 EYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTLHMGHSHWIGKDSLSDKEKVA 589 

Query: 22 5 XXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRY 2 84 

+ P A+ + A +Y+ + + 

Sbjct: 590 AQAYTKEKGILPPSPDADVKANPTGDSAAA IYNRVKGEKR 629 

Query: 285 SESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I P YS 319 

L + + + IPH DHYH I + + 

Sbjct: 630 I PLVRL P YMVEHTVE VKNGNL I I PHKDH YHN I KFA 664 



Score =43.5 bits (101), Expect = 0.021 

Identities = 52/271 (19%), Positives = 94/271 (34%), Gaps = 93/271 (34%) 

Query: 384 I VE E TAT AY I VRHGDH FH Y IPKSNQIGQPTL PNNSLATPSPSLPINPGI 432 

+ + + T Y+ HGDH+HY +P ~f~ L PN h + G 

Sbjct: 70 VIKITDQGYVTSHGDHYHYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVNEVKGGYVIKV 129 

Query: 433 SHEKHEEDG Y 442 

H +H E G Y 
Sbjct: 13 0 DGKYYVYLKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGY 18 9 

Query: 443 GFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL E 486 

F+A+ +1+ HG+H HY K +L+ + + AA+ L 

Sbjct: 190 IFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNS 249 

Query: 487 EVKTSHNGLDSLS SHEQDYPGNAKEM-KDLDKKIEEKIAGIMKQYGVKRES 536 

+ + N + S+S S+ + G A + +D+D + + + A + + V+ + 

Sbjct: 250 DNTSRTNWVPSVSNPGTTNTNTSNNSNTNGQASQSNEDVDSLLKQLYALPLSKRHVESDG 309 

Query: 537 IV VNKE KNA 1 1 Y PHGDHHHAD P I DE 561 

+V ++ + PHGDH+H P + 

Sbjct: 310 LVFDPAQITSRTARGVAVPHGDHYHFIPYSQ 340 



tr Q97QM8 Conserved domain protein [SP1175] [Streptococcus pneumoniae] 802 AA 

a lign 

Score = 424 bits (1089), Expect = e-117 

Identities = 209/352 (59%), Positives = 259/352 (73%), Gaps = 20/352 (5%) 

Query : 18 LSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQ 77 

LS+C+Y L ++++ K+NNRVSY+DG Q++QK+ENLTPD+VS++EGI AEQIVIKITDQ 
Sbjct : 3 LSVCSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQ 62 

Query: 78 GYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVY 137 

GYVTSHGDHYHYYNGKVPYDA+ SEELLMKDPNY+LKD D I VNE VKGGY+ I KVDGKY YVY 
Sbjct: 63 GYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYKLKDEDIWEVKGGYVIKVDGKYYVY 122 
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Query: 138 LKDAAHADNVRTKDEINRQKQEHVKDNE KVNSNVAVARSQGRYTTNDGYVFNPAD I 193 

LKDAAHADNVRTK+E INRQKQ.EH + E + + VA+ARSQGRYTT+DGY+FN +DI 

Sbjct: 123 LKDAAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDI 182 

Query: 194 IEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 24 8 

IEDTG+AYIVPHG HYHYIP + +y S DN 

Sbjct: 183 IEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTN 242 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI 297 

NT + +T+ A++S ++ SLLK+LY P + QR+ ESDGLVFDPA+I 

Sbjct: 243 WVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPAQI 302 

Query: 298 I SRTPNGVAI PHGDH YHF I PYS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE 349 

SRT GVA+ PHGDHYHF I PYS + + S LEE+IAR +P+ + ++ + P + 
Sbjct: 303 TSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQ 354 



Score = 79.0 bits (193), Expect = 5e-13 

Identities = 65/282 (23%), Positives = 108/282 (38%), Gaps = 57/282 (20%) 

Query: 280 SAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGS 339 

S RY+ DG 4-F+ + 11 T + +PHGDHYH+IP ++LSA E A +SG G+ 
Sbjct: 165 SQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSA-SELAAAEAFLSGRGN 223 

Query: 340 TVSTN AKPNEWXXXXXXXXXXXXXXXXKELSSA 373 

T TN + P ++ g 

Sbjct: 224 LSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKL 283 

Query: 3 74 SDGYI FNPKDI VEETATAYI VRHGDHFHYI PKSNQI G 410 

SDG 4-F+P I TA V HGDH+H+IP S 
Sbjct: 284 PLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRS 343 

Query: 411 QPTLPNNSLATPSPSLPINPGISHEKHEE DGYGFDANRIIAEDESGFIMSHGNHNHY 467 

+ P++ PSP P + D + + + + + G++ + Y 

Sbjct: 344 NHWVPDSRPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGISRY 403 

Query: 468 FFKKDLTEEQIKAAQKHLEEVKTSHNGL DSLSSHEQDY 5 05 

F KDh E +K + L + +•+ + L ++ 4- + 
Sbjct: 404 VFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEF 445 



Score =77.8 bits (190), Expect = le-12 

Identities = 103/504 (20%), Positives = 189/504 (37%), Gaps = 60/504 (11%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEEL LMKDPNYQLKDADI 118 

+1+ T Y+ HGDHYHY +P + L + EL L Y+ +++D 

Sbjct: 182 I IEDTGDAYIVPHGDHYHY 1 P KNE L S AS E L AAAE AFL S GRGNL S N S RT YRRQNS D - 236 

Query: 119 VNE VKGGYI I KVDGKY YVYLKDAAHADNVRTKDE INRQKQ - EHVKDNEKVNSNVAVARS Q 177 

N + ++V NT+NQ D + + + 

Sbjct: 237 -NTSRTNWVPSVSNP GTTNTNTSNNSNTNSQAS QSND IDS LLKQL YKLPLS 286 

Query: 178 GRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPS 237 

R+ +DG VF+PA I T VPHG HYH+IP + p 

Sbjct: 287 QRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIA RIIPL 339 

Query: 23 8 QLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSE-SDGLVFDPAK 2 96 
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4- 4- + + T +P+ 4- 4-L 4- S 4-Q + 4-G VF4- K 

Sbjct: 340 RYRSNHWVPDSRPEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEE-K 398 

Query: 297 1 1 SRTPNGVAI PHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WXX 353 

ISR Y F 4~P + LE K+ + 4-4* +s T + N P + 

Sbjct: 399 GISR YVFAKDLPSETVKNLESKLSKQESVSHTLTAKKENVAPRDQEFY 446 

Query: 354 XXXXXXXXXXXXXXKELS S ASDG YI FNPK - - D I VEETATAY I VRHGDHFHYI PKSNQ I GQ 411 

E + + K+ + + +T D +4- + 

Sbjct: 447 DKAYNLLTEAHKALFENKGRNSDFQALDKLLERLNDESTNKEKLVDDLLAFLAPITHPER 506 

Query: 412 PTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKK 471 

PN+ 4- + I 4-4-4- DGY FD + II+++ ++ H H+H+ K 

Sbjct: 507 LGKPNSQIEYTEDEVRIAQ-LADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKD 565 

Query: 472 DLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYG 531 

L++++ AAQ + +E D + + + G++ K E+ + I 4- Y 

Sbjct: 566 SLSDKEKVAAQAYTKEKGILPPSPD- -ADVKANPTGDSAAAI YNRVKGEKRIPLVRLPYM 623 

Query: 532 VKRES I WNKEKNAI I YPHGDHHH 55 5 

V+ V 4- +1 PH DH+H 

Sbjct: 624 VEH TVEVKNGNL I I PHKDHYH 644 

Score =60.1 bits (144), Expect = 2e-07 

Identities = 73/372 (19%), Positives = 131/372 (34%), Gaps = 107/372 (28%) 

Query: 23 9 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 2 98 

L 4- T +HN S G + K+ENL + S + +G4- + +1 

Sbjct: 12 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 57 

Query: 299 SRTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXX 354 

T G HGDHYH+ 4-PY 4- + 
Sbjct: 58 KITDQGYVTSHGDHYHYYNGKVPYDAI IS 86 

Query: 355 XXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR HGDHFH 401 

+EL Y +DIV E Y+4- + H D+ 

Sbjct: 87 EELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVR 133 

Query: 402 YIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMS 460 

+ N+ Q + TP + S 4-4- +DGY F+A+ II 4- +1 + 

Sbjct: 134 TKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVP 193 

Query: 461 HGNHNHYFFKKDLTEEQIKAAQKHL EEVKTSHNGLDSLS 499 

HG+H HY K 4-L + + 4- AA+ L + + N 4- S + S 

Sbjct: 194 HGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTT 253 

Query: 500 SHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIV VNKEKNAI I YP 549 

S+ + A 4- D+D 4-4-4- 4-Q V+ 4- 4-V + P 

Sbjct: 254 NTNTSNNSNTNSQASQSNDIDSLLKQLYKLPLSQRHVESDGLVFDPAQITSRTARGVAVP 313 

Query: 55 0 HGDHHHADPIDE 561 

HGDH+H P + 
Sbjct: 314 HGDHYHFIPYSQ 32 5 

Score =57.0 bits (136), Expect = 2e-06 

Identities = 36/155 (23%), Positives = 57/155 (36%), Gaps = 20/155 (12%) 
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Query: 165 EKATNSNVAVARSQGRYTTNDGYVFNPADI IEDTGNAYIVPHGGHYHYI PXXXXXXXXXXX 224 

E V +A+ +YTT+DGY+F+ DII D G+AY+ PH GH H+I 

Sbjct: 515 EYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTPHMGHSHWIGKDSLSDKEKVA 574 

Query: 225 XXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRY 284 

+ P A+ + A +Y+ + + 

Sbjct: 575 AQ AYTKEKG I L PPS PDAD VKANPTGD S AAA IYNRVKGEKR 614 

Query: 285 S E SDGL VFDP AKI I S RTPNGVAI PHGDHYHF I PYS 319 

L + + + IPH DHYH I ++ 

Sbjct: 615 I P L VRL P YMVEHTVE VKNGNL I I PHKDH YHN I KF A 649 



tr 09 ANY;? Pneumococcal histidine triad protein D precursor (Fragment) 83 9 AA 
(Hypothetical protein SP1003) (Fragment) [phtD] 

[Streptococcus pneumoniae] align 
Score = 423 bits (1088), Expect = e-117 

Identities = 215/359 (59%), Positives = 268/359 (73%), Gaps = 14/359 (3%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRS-QENKDNNRVSYVDGSQSSQKSENLTPDQV 59 

MK +KKY+A GS +++LS+C+Y L +H++ Q K++NRVSY+DG Q+ QK+ENLTPD+V 
Sbjct: 1 MKINKKYLA-GSVAVLALSVCSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEV 59 

Query: 60 SQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIV 119 

S++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIV 
Sbjct: 60 SKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIV 119 

Query: 12 0 NEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSN VAVARS 176 

NE+KGGY+ IKVDGKYYVYLKDAAHADN+RTK+EI RQKQEH N SN V AR+ 
Sbjct: 12 0 NEIKGGYVIKVDGKYYVYLKDAAHADNIRTKEEIKRQKQEH-SHNHGGGSNDQAWAARA 178 

Query: 177 QGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQP 236 

QGRYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP q 
Sbjct: 179 QGRYTTDDGYI FNASDI IEDTGDAYI VPHGDHYHYI P - - KNELS ASELAAAEAYWNGKQG 236 

Query: 237 SQLSYSSTASDNNTQ SVAKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGL 290 

S+ S SS+ + N Q S TP N+ EN+ SLL+ELY P ++R+ ESDGL 

Sbjct: 237 SRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGL 296 

Query: 291 VFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 349 

+FDPA+I SRT GVA+PHG+HYHFIPY ++S LE++IAR +P+ + +++P + 
Sbjct: 297 IFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQ 355 

Score = 77.8 bits (190), Expect = le-12 

Identities = 106/501 (21%), Positives = 185/501 (36%), Gaps = 62/501 (12%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKV 130 

+1+ T Y+ HGDHYHY +P+L+EL ++K + 
Sbjct: 195 I I EDTGD A Y I VPHGDHYH Y IPKNELSASELAAAEAYWNGKQGSRPSSSSS 244 

Query: 131 DGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQG- --RYTTNDGYV 187 

Y + A + + H E ++S + ++ R+ +DG + 

Sbjct: 245 YNANPAQP-RLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLI 2 97 
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Query: 188 FNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXX NMQPSQLS 240 

F4-PA I T VPHG HYH4-IP + +P Q S 

Sbjct: 298 FDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPS 357 

Query: 241 YS STASDNNTQS VAKGSTS KPANKSENLQSLLKEL YDS PS AQRYS ESDGLV - FDPAKI I S 299 

ST + + A P+N + + L+KE E +G+ + PAK +S 

Sbjct: 358 PQSTPEPSPSPQPAPNPQPAPSNPID- -EKLVKEAVRKVGDGYVFEENGVSRYIPAKDLS 415 

Query: 300 -RTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXX 358 

T G+ SKL A +E 4-+ +4- T S N-f 

Sbjct: 416 AETAAGID SKL-AKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIH 462 

Query: 359 XXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNS 418 

+ A D 4- KD-f + D ++ + PN 

Sbjct: 463 QDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLV DD I LAFLAP IRHPERLGKPNAQ 518 

Query: 419 LATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQI 478 

+ + + 4-4- + EDGY FD I + 4-4- ++ H H+H+ K L+E + 

Sbjct: 519 ITYTDDEIQVAK-LAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAER 577 

Query: 47 9 KAAQKHLEEVKTSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIAGIMKQYGVKR 534 

AAQ + + E GL S + QD GN AK + + 4-4-4- +K+ Y 4-4- 

Sbjct: 578 AAAQAYAKE KGLTPPSTDHQD - SGNTEAKGAEAI YNRVKAAKKVPLDRMPYNLQ - 630 

Query: 535 ES I WNKEKNAI I YPHGDHHH 555 

V + ++I PH DH+H 
Sbjct: 631 - - YTVEVKNGS L 1 1 PHYDHYH 64 9 



Score = 77.0 bits (188), Expect = 2e-12 

Identities = 72/276 (26%), Positives = 109/276 (39%), Gaps = 54/276 (19%) 

Query: 2 83 RYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVS 342 

RY+ DG +F+ + II T 4- 4- PHGDH YH4- 1 P + 4-LSA E A GS S 

Sbjct: 181 RYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPS 240 

Query: 343 T NAKP NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNP 381 

+ NA P N V +EL 4- SDG IF + P 

Sbjct: 241 SSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDP 300 

Query: 3 82 KDIVEETATAYIVRHGDHFHYIP KSN QIGQPTLPN 416 

I TA V HG+H+H+IP +SN + QP4- + 

Sbjct: 301 AQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQS 360 

Query: 417 NSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEE 476 

4-PSP NP 4- 4-4- 4-A R 4- 4- G+ + 4- Y KDL+ E 

Sbjct: 361 TPEPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGVSRYIPAKDLSAE 417 

Query: 477 Q I KAAQKHL - EE VKTSHNGLDSLS SHEQD YPGNAKE 511 

L 4-+ SH L + + D P + +E 

Sbjct: 418 T AAG IDS KL AKQE S L S H KLGAKKTDLPSSDRE 449 

Score = 67.4 bits (163), Expect = le-09 

Identities = 49/190 (25%), Positives = 79/190 (40%), Gaps = 30/190 (15%) 
Query: 138 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 
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LKD +D V+ D+I R + K N +4 +4- VA+ G+YTT DGY+F 

Sbjct: 485 LKDVP-SDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIF 543 

Query: 18 9 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 24 8 

+P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 544 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 598 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

4 4 AKG+ + +Y+ A + D + + 4 + + IP 

Sbjct: 599 SGNTEAKGA EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IP 643 

Query: 309 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 644 HYDHYHNIKF 653 



Score = 48.5 bits (114), Expect = 7e-04 

Identities = 51/288 (17%) , Positives = 104/288 (35%) , Gaps = 55/288 (19%) 

Query: 297 1 1 SRTPNGVAI PHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGS TVSTNAKPNE WX 352 

+ 1 T G HGDHYH+ + PY + + E + 4 + P + K 444V 

Sbjct: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDAI - - ISEELLMKDP NYQLKDSDIVN 120 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQP 412 

L A+ K+ +4- + HG 44Q 
Sbjct: 121 E I KGGYVI KVDGKYYVYLKDAAHADNI RTKEE I KRQKQEHSHNHGGG SNDQAWA 175 

Query: 413 TLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKD 472 

T +DGY F+A+ II + +1+ HG+H HY K 4 

Sbjct: 176 ARAQGRYTT DDGYIFNASDI IEDTGDAYIVPHGDHYHYIPKNE 218 

Query: - 473 LTEEQ I KAAQKHLEE VKTSHNGLDSLS SHEQD YPG NAKEMKDLDKKI 519 

L+ +4- AA+ + + S S + P + + + 

Sbjct: 219 LSASELAAAEAYWNGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLL 278 

Query: 52 0 E E KI AG I MKQ YGVKRE S I WNKEKNAI I YPHGDHHHADPIDE 561 

EA++V4 + 4- +++ + PHG-fH+H P 44 

Sbjct: 279 RELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQ 326 



Score =42.0 bits (97), Expect = 0.061 

Identities = 44/185 (23%) , Positives = 64/185 (33%) , Gaps = 36/185 (19%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

4 DGYIF+P+DI 4 AY 4 H H H+I K + Q L PS 

Sbjct: 536 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 595 

Query: 426 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+ G + K E D ++ + 1+ H +H H 

Sbjct: 596 HQ - D S GNTE AKGAE A I YNRVKAAKKVPLDRMP YNLQ YTVE VKNGS L I I PH YDH YHN I KFE 654 

Query: 467 YFFKKDLTEEQIKAAQK HL EE VKTSHNGLDSLS SHE QDYPGNAKEMKDLD 516 

Y KTE + AK HE S NG 4 S H 4 4 DD 

Sbjct: 655 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNKVDQDSKPDED 714 

Query: 517 KKIEE 521 

K4 4E 

Sbjct: 715 KEHDE 719 
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Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP I S + PH H H+I LS E A+ 

Sbjct: 525 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 584 

Query: 33 3 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 385 

P + + +T AK E + D +N + V 

Sbjct: 585 KE KGL T P P S TDHQD S GNTE AKGAE A I YNRVKAAK KVPLDRMPYNLQYTV 633 

Query: 3 86 E E TAT AY I VRHGDH FH Y I 4 03 

E 4- 1+ H DH+H I 

Sbjct: 634 EVKNGS L 1 1 PHYDHYHNI 651 



tr .Q57QM9 Conserved domain protein [SP1174] [Streptococcus pneumoniae] 819 AA 

align 

Score = 423 bits (1087) , Expect = e-116 

Identities = 211/357 (59%), Positives = 271/357 (75%), Gaps = 11/357 (3%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQV 5 9 

MK +KKY+A GS +++LS+C+Y L ++++ Q + K++NRV+Y+DG Q+ QK+ENLTPD+V 
Sbjct: 1 MKINKKYLA-GSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQKAENLTPDEV 59 

Query: 60 SQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIV 119 

S++EGI AEQIVIKITDQGYVTSHGDHYHYYNGK.VPYDA+ SEELLMKDPNYQLKD+DIV 
Sbjct: 60 SKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIV 119 

Query: 12 0 NEVKGGYIIKVDGKYYVYLKDAAHAD 178 

NE+KGGY+IKV+GKYYVYLKDAAHADN+RTK+EI RQKQE + N + ++ VA AR+QG 
Sbjct: 12 0 NE I KGG YVI KVNGKYYVYLKDAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQG 179 

Query: 179 RYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQ 238 

RYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP Q s+ 

Sbjct: 180 RYTTDDGYIFNASDIIEDTGDAYI VPHGDHYHYIP- -KNELSASELAAAEAYWNGKQGSR 237 

Query: 239 LSYSSTASDNNTQ SVAKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVF 292 

S SS + + N Q S TP N+ EN-*- SLL+ELY P ++R+ ESDGL + F 

Sbjct: 238 PSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIF 297 

Query: 2 93 DPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 34 9 

DPA+I SRT GVA+PHG+HYHFIPY ++S LE++IAR +P+ + + 4-+P E 

Sbjct: 298 DPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEE 354 



Score =82.0 bits (201), Expect = 5e-14 

Identities = 93/391 (23%), Positives = 135/391 (33%), Gaps = 91/391 (23%) 

Query: 193 1 1 EDTGNAY I VPHGGHYH Y I PXXXXXXXXXXXXXXXXXXXNM 234 

+1+ T Y+ HG HYHY p + 

Sbjct: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKV 130 
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Query: 23 5 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 2 93 

Y A+ + + + + S N S 4 AQ RY+ DG + F+ 

Sbjct: 131 NGKYYVYLKDAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYI FN 190 

Query: 294 PAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVST NAKP-- 34 7 

+ 11 T + +PHGDHYH+IP ++LSA E A GS S+ NA P 

Sbjct: 191 ASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQ 250 

Query: 348 NEWXXXXXXXXXXXXXXXXKELSSA- - S DG Y I FN P KD I VE E TAT AY 392 

N V +EL + SDG IF+P I TA 

Sbjct: 251 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGV 310 

Query: 3 93 IVRHGDHFHYIP KSNQ IGQPTLPNNSLATPS 42 3 

V HG+H4H+IP +SN QPT + P + 

Sbjct: 311 AVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEEPSPQPTPEPSPSPQPA 370 

Query: 424 PSLPINPGISHE - - KHEEDGYGFDANRI IAEDESGFIMSHGNHNHYFFKKDLTEEQIKAA 481 

PS P1 + + E + DGY F+ N + + Y KDL+ E 

Sbjct: 371 PSNPIDEKLVKEAVRKVGDGYVFEENGV SRYIPAKDLSAETAAGI 415 

Query: 482 QKHL-EEVKTSHNGLDSLSSHEQDYPGNAKE 511 

L + + SH L 4- + D P + +E 

Sbjct: 416 DSKLAKQESLSH KLGTKKTDLPSSDRE 442 



Score = 76.3 bits (186), Expect = 3e-12 

Identities = 116/555 (20%), Positives = 213/555 (37%), Gaps = 95/555 (17%) 

Query : 2 9 RSQENKDNNRVS YVDGS QS SQKSENLTPDQVS QKEG - 1 QAEQI VI KI TDQGYVTSHGDHY 8 7 

R + 4 + +N S D 4- +4- 4-44 + + 4-G I 4-1+ T Y+ HGDHY 

Sbjct: 155 RQKQERSHNHNSRADNAVAAARAQG RYTTDDGYIFNASDI IEDTGDAYIVPHGDHY 210 

Query: 8 8 HYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKDAAHADNV 14 7 

HY +P + L + EL + + K + Y + A 4 

Sbjct: 211 HY 1 PKNELSASELAAAEAYWNGKQGSRPSS S SS YNANPAQP - RL 253 

Query: 148 RTKDEINRQKQEHVKDNEKVNSNVAVARSQG RYTTNDGYVFNPADI IEDTGNAYI VP 204 

+ HE ++S + 4+ R4 4DG +F+PA IT VP 

Sbjct: 254 SENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVP 313 

Query: 205 HGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTS -KPAN 263 

HG HYH+IP ++ S ++Q +SS +PA 

Sbjct: 314 HGNHYHFIPYEQMSELEKRIARIIPL- -RYRSNHWVPDSRPEEPSPQPTPEPSPSPQPAP 371 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSA 323 

+ 4 L+KE DG VF+ NGV+ +IP LSA 

Sbjct: 372 SNPIDEKLVKEAVRKVG DGYVFEE NGVS RYIPAKDLSA 409 

Query: 324 LEEKIARRVPISGTGSTVSTNAKP NEWXXXXXXXXXXXXXXXXKELSS 372 

+4 K+A++ 4S T T+ N+ + 

Sbjct: 410 ETAAGIDSKLAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFE 469 

Query: 373 ASDGY I FNPKD I VEETATAYI - - VRHGDHFHY I PKSNQ IGQPTLPNNSLATPS P 424 

AD + KD+ + E A+4 +RH + ++G+P N + 
Sbjct: 470 ALDNLLERLKDVS SDKVKLVED I LAFLAP I RHPE RLGKP NAQITYTDD 517 

Query: 425 SLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKH 484 
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+ + + 4- + EDGY FD I +++ ++ H H+H+ K L+E 4- AAQ 4- 
Sbjct: 518 E I QVAK - L AGKYTTEDG Y I FDPRD I TS DEGD AYVTPHMTHSHW I KKD S L S EAERAAAQAY 576 

Query: 485 LEEVKTSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIAGIMKQYGVKRES IWN 540 

+E GL S + QD GN AK + + +4-+ +K> Y + 4- V 
Sbjct: 577 AKE KGLTPPSTDHQD - SGNTEAKGAEA I YNRVKAAKKVPLDRMPYNLQ YTVE 627 

Query: 541 KEKNAI I YPHGDHHH 555 

4- 4-4-1 PH DH+H 
Sbjct: 62 8 VKNGS L 1 1 PHYDHYH 642 

Score = 66.6 bits (161), Expect = 2e-09 

Identities = 48/190 (25%), Positives =80/190 (41%), Gaps = 30/190 (15%) 

Query: 138 LKDAAHADNVRTKDEIN RQKQEHVKDNEKV NSNVAVARS QGRYTTNDG YVF 188 

LKD + +D V+ ++I R + k N ++ 4- + VA+ G+YTT DGY+F 

Sbjct: 478 LKDVS - SDKVKLVEDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIF 536 

Query: 18 9 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 24 8 

+ P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 537 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 591 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

+ + AKG+ 4- +Y+ A + D + ++ 4- -f IP 

Sbjct: 592 SGNTEAKGA E AI YNRVKAAKKVPLDRMPYNLQ YTVE VKNGSLI IP 636 

Query: 309 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 63 7 HYDHYHNIKF 646 

Score = 50.1 bits (118), Expect = 2e-04 

Identities = 55/302 (18%) , Positives = 108/302 (35%) , Gaps = 84/302 (27%) 

Query: 297 I I SRTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWX 352 

+ 1 T G HGDHYH+ +PY + + 
Sbjct: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDAI IS 101 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 406 

+EL Y DIV E Y+4-+ +G Y+ + 

Sbjct: 102 E E L LM KD PN YQL KD S D I VNE I KGG Y V I KVNGK Y Y V YL KD AAHADN 146 

Query: 4 07 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 458 

Q + + +NS A + + G +DGY F+A+ II + +1 

Sbjct: 147 IRTKEEIKRQKQERSHNHNSRADNAVAAARAQG RYTTDDGYIFNASDIIEDTGDAYI 203 

Query: 45 9 MSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPG 507 

+ HG+H HY K +L+ ++ AA+ + + S S 4- P 

Sbjct: 204 VPHGDH YH Y I P KNE L S AS E L AAAE AYWNG KQG SRPSSSSS YNANP AQ PRL S ENHNLT VT P 263 

Query: 508 - -NAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKE KNAI I YPHGDHHHAD P I 559 

+ + + + + +EA4-4-V+ + + 4-4-4- 4- PHG+H4-H P 

Sbjct: 264 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPY 323 
Query: 560 DE 561 

+ 4- 

Sbjct: 324 EQ 325 
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Score = 44.7 bits (104), Expect = 0.009 

Identities = 45/186 (24%), Positives = 63/186 (33%), Gaps = 36/186 (19%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI + AY+ H H H+I K 4- Q L PS 

Sbjct: 529 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 588 

Query: 426 LPINPGISHEKHEE DGYGFDANRIIAEDESGFIMSHGNHNH 466 

+ G+ KE D++ + I+H+HH 

Sbjct: 589 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 647 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y KTE+AK HE SNG+SHQ + + 

Sb j C t : 64 8 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 707 

Query: 517 KKIEEK 522 

K EEK 
Sbjct: 708 KPSEEK 713 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%) , Positives = 51/138 (36%) , Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG + FDP I S + PH H H+I LS E A+ 

Sbjct: 518 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 577 

Query: 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 385 

P + + +T AK E + d +N + V 

Sbjct: 57 8 KEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAK KVPLDRMPYNLQYTV 62 6 

Query: 3 86 E E TAT AY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 627 E VKNGS L 1 1 PH YDH YHN I 644 



tr Q 9 ANY 3 Pneumococcal histidine triad protein B precursor (Fragment) 819 
[phtB] ^ 
[Streptococcus pneumoniae] align 

Score = 423 bits (1087) , Expect = e-116 

Identities = 211/357 (59%), Positives = 271/357 (75%), Gaps = 11/357 (3%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQV 59 

MK +KKY4-A GS +++LS+C+Y L + + + + Q+ K++NRV+Y+DG Q+ QK+ENLTPD+V 
Sbjct : 1 MKINKKYLA-GSVAVLALSVCSYELGRYQAGQDKKESNRVAYIDGDQAGQKAENLTPDEV 59 

Query: 60 SQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIV 119 

S++EGI AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIV 
Sbjct: 60 SKREGINAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIV 119 

Query: 120 NEVKGGYI IKVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD -NEKVNSNVAVARSQG 178 

NE+KGGY+ I KV+GKYYVYLKDAAHADN+RTK+E I RQKQE + N + ++ VA AR+QG 
Sbjct: 120 NE I KGGYVI KVNGKY YVYLKDAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQG 179 
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Query: 179 RYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQ 23 8 

RYTT+DGY+FN +DIIEDTG+AYIVPHG HYHYIP Q s + 

Sbjct: 180 RYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIP- -KNELSASELAAAEAYWNGKQGSR 23 7 

Query: 23 9 LSYSSTASDNNTQ- — SVAKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVF 292 

S SS+ + N Q S TP EN+ SLL+ELY P ++R + ESDGL+F 

Sbjct: 238 PS S S S SYNANPAQPRLS ENHNLTVTPTYHQNQGEN I S SLLRELYAKPLS ERHVE SDGL I F 297 

Query: 293 DPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 349 

DPA+I SRT GVA+PHG+HYHFIPY ++S LE4-4-IAR +P+ + + 4-+P E 

Sbjct: 298 DPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEE 354 



Score = 75.9 bits (185), Expect = 4e-12 

Identities = 110/554 (19%), Positives = 208/554 (36%), Gaps = 93/554 (16%) 

Query: 2 9 RSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEG-IQAEQIVIKITDQGYVTSHGDHY 87 

R ++ + +N S D + +4- 44-4- +■ 4- +G I +1+ T Y-f HGDHY 

Sbjct: 155 RQKQERSHNHNSRADNAVAAARAQG RYTTDDG YI FNASD 1 1 EDTGDAY I VPHGDHY 210 

Query: 88 HYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKDAAHADNV 14 7 

HY +P + L + EL -f + K + Y 4- A + 

Sbjct: 211 HY IPKNELSASELAAAEAYWNGKQGSRPSSSSS YNANPAQP - RL 253 

Query: 148 RTKDE INRQKQEHVKDMEKVNSNVAVARSQG RYTTNDGYVFNPADI IEDTGNAYIVP 204 

+ HE ++S + + 4- R+ +DG 4-F+PA IT VP 

Sbjct: 254 SENHNLTVTPTYHQNQGENIS SLLRELYAKPLS ERHVE SDGL I FDPAQITSRTARGVAVP 313 

Query: 205 HGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANK 2 64 

HG HYH+IP +P++D+4-+T 4-P + 

Sbjct: 314 HGNHYHFIPYEQMSELEKRIA RIIPLRYRSNHWVPDSRPEEPSPQPTPEPS - - 364 

Query: 265 SENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSA- 323 

SP + DG + A + 4- +G +IP LSA 

Sbjct: 365 PSPQPAPSNPIDGKLVKEA- - VRKVGDGYVFEENGVSRYIPAKDLSAE 410 

Query: 324 LEEKIARRVPISGTGSTVSTNAKP NEWXXXXXXXXXXXXXXXXKELSSA 373 

4-+ K4-A+4- +S T T+ N+ + A 

Sbjct: 411 TAAGIDSKLAKQESLSHKLGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEA 470 

Query: 374 SDGYIFNPKDI VEETATAYI - - VRHGDHFHYIPKSNQIGQPTLPNNSLATPSPS 425 

D 4- KD+ + E A+ + 4-RH 4- 4-4G+P N + 
Sbjct: 471 LDNLLERLKDVSSDKVKLVEDILAFLAPIRHPE RLGKP NAQITYTDDE 518 

Query: 426 LPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL 485 

+ + +4-4- EDGY FD I 4 + 4- ++ H H4-H4- K L+E 4- AAQ + 
Sbjct: 519 IQVAK-LAGKYTAEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 577 

Query: 486 EEVKTSHNGLDSLS SHEQDYPGN - - AKEMKDLDKKIE - - EKI AGIMKQYGVKRES I WNK 541 

E GL S4 QD GN AK + 4- + + 4- +K4- Y + 4- V 
Sbjct: 578 XE KGLTPPSTDHQD-SGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQ YTVEV 628 

Query: 542 EKNAI I YPHGDHHH 555 

+ + + I PH DH4-H 
Sbjct: 62 9 KNGSLIIPHYDHYH 642 
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Score =73.9 bits (180), Expect = le-11 

Identities = 66/268 (24%), Positives = 96/268 (35%), Gaps = 85/268 (31%) 

Query: 2 97 1 1 SRTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVP ISGTGST 34 0 

+1 T G HGDHYH+ +PY + + E + + + p I G G 

Sbjct: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDAI - - ISEELLMKDPNYQLKDSDIVNEIKG-GYV 127 

Query: 341 VSTNAK PNEWXXXXXXXXXXXXXXXXKELSSA SDGY 377 

+ N K E + + + +A DGY 

Sbjct: 12 8 IKVNGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGY 187 

Query: 378 I FNP KD I VE E TAT AY I VRHGDH FH Y I P KS N q 408 

I FN DI+E+T AYIV HGDH+HYIPK+ 
Sbjct: 188 I FNAS D I I ED TGD AY I VPHGDH YH Y I P KNEL S AS E LAAAE AYWNG KQG SRPSSSSS YNAN 247 

Query: 409 IGQPTLPNNSLATPSPSLPINPG ISHEKHEEDGYGFDANRIIAEDE 454 

QP L N T + P + N G +S E DG FD +1 + 

Sbjct: 248 PAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTA 307 

Query: 455 SGFIMSHGNHNHYFFKKDLTEEQIKAAQ 4 82 

G + HGNH H+ + ++E + + A+ 
Sbjct: 308 RGVAVPHGNHYHFIPYEQMSELEKRIAR 335 

Score =64.7 bits (156), Expect = 9e-09 

Identities = 48/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 

LKD + +D V+ + + I R + K N ++ + + VA+ G+YT DGY-fF 

Sbjct: 478 LKDVS -SDKVKLVEDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLAGKYTAEDGYIF 536 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+ P DI D G+AY+ PH H H+I 4- P ST + 

Sbjct: 537 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAXEKGLTP PSTDHQD 591 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

+ + AKG+ + +y+ A + D + ++ + + IP 

Sbjct: 592 SGNTEAKGA E AI YNRVKAAKKVPLDRMP YNLQ YTVE VKNGS L IIP 636 

Query: 309 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 63 7 HYDHYHNIKF 646 

Score = 47.4 bits (111), Expect = 0.001 

Identities = 47/255 (18%), Positives = 92/255 (35%), Gaps = 77/255 (30%) 

Query: 384 I VE E TAT AY I VRHGD H F H Y 1 PKSNQIGQPTL PNNSL 419 

+ + + T Y+ HGDH+HY +P I + L PN L 

Sbjct: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKV 130 

Query: 42 0 ATPSPSLPINPGISHEKHE EDGYGFD 445 

A + ++ I +K E +DGY F+ 

Sbjct: 131 NGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFN 190 

Query: 446 ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDY 505 

A+ II + +1+ HG+H HY K +L+ ++ AA+ + + S S + 

Sbjct: 191 ASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQ 250 
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Query: 506 PG NAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI 546 

P + + + + + +EA++V+++ +++ + 

Sbjct: 251 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGV 310 

Query: 547 IYPHGDHHHADPIDE 561 

PHG+H+H P + 4- 
Sbjct: 311 AVPHGNHYHFIPYEQ 325 



Score = 45.8 bits (107), Expect = 0.004 

Identities = 45/185 (24%) , Positives = 65/185 (34%) , Gaps = 34/185 (18%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLA TPSPSL 426 

+A DGYIF+P+DX + AY+ H H H+I K + + A TP + 

Sbjct: 529 TAEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAXEKGLTPPSTD 588 

Query: 427 PINPGISHEKHEE DG YG FD ANR 1 1 AED E S G F I M S HGNHNH 466 

+ G + K E D + + + 1+ H +H H 

Sbjct: 589 HQDSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEW 648 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDK 517 

Y KTE+AK HE SNG-fSHQ + 
Sbjct: 64 9 FDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTEK 708 

Query: 518 KIEEK 522 

EEK 

Sbjct: 709 PSEEK 713 



Score = 40.0 bits (92), Expect = 0.23 

Identities = 32/130 (24%), Positives = 49/130 (37%), Gaps = 3/130 (2%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVP 333 

E+ + A +Y+ DG +FDP IS 4- PH H H+I LS E A+ 
Sbjct: 518 E I QVAKLAGKYTAEDG YI FDPRD I TSDEGDAYVTPHMTHSHWI KKDS LS E AERAAAQAYA 577 

Query: 334 ISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYI 393 

+ ST+ + + K D+N+VE+I 
Sbjct: 57 8 XE KGLTP P S TDHQD SGNTE AKGAE AI YNRVKAAKK VPLDRMP YNLQYTVE VKNGS L I 634 

Query: 3 94 VRHGDHFHYI 4 03 

+ H DH+H I 
Sbjct: 63 5 IPHYDHYHNI 644 



tr Q6WNQ3 Surface protein BVH-11 (Fragment) [bvh-11] [Streptococcus 811 AA 
pneumoniae] 

align 

Score = 417 bits (1071) , Expect = e-115 

Identities = 208/337 (61%), Positives = 256/337 (75%), Gaps = 10/337 (2%) 

Query : 2 1 CAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGY 7 9 

C+Y L +H++ Q K++NRVSY+DG Q+ QK+EMLTPD+VS++EGI AEQIVIKITDQGY 
Sbj Ct : 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 
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Query: 8 0 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLK 13 9 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKVDGKYYVYLK 
Sbjct: 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLK 120 

Query: 14 0 DAAHADNVRTKDEINRQKQEHVKD-NEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTG 198 

DAAHADN+RTK+E I RQKQE + N + + + VA AR+QGRYTT+DGY+FN +DIIEDTG 
Sbjct: 121 DAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTG 180 

Query: 199 NAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ SVAK 255 

+AYIVPHG HYHYIP Q S+ S SS+ + N Q S 

Sbjct: 181 DAYIVPHGDHYHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENH 238 

Query: 256 GSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDH 312 

TP N+ EN+ SLL+ELY P ++R+ ESDGL+FDPA+I SRT NGVA+PHGDH 
Sbjct: 239 NLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDH 298 

Query: 313 YHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 34 9 

YHFIPYS+LS LEEK+AR +P + + + 

Sbjct: 299 YHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQ 335 

Score = 84.3 bits (207), Expect = le-14 

Identities = 90/385 (23%), Positives - 134/385 (34%), Gaps = 73/385 (18%) 

Query: 193 I IEDTGNAYIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNM 234 

+1+ T Y+ HG HYHY P + 

Sbjct : 52 VI KI TDQGYVTSHGDHYHYYNGKVP YDAI I SEELLMKDPNYQLKDSD I VNE I KGGYVI KV 111 

Query: 235 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 2 93 

Y A+ + + + +S N S + AQ RY+ DG +F + 

Sbjct: 112 DGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFN 171 

Query: 2 94 PAKI I SRTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGSTVST NAKP-- 347 

+ 11 T + +PHGDHYH+IP S LSA E A+ GS S + NA P 

Sbjct: 172 ASDIIEDTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQ 231 

Query: 34 8 NEWXXXXXXXXXXXXXXXXKELSSA SDGYI FNPKD I VEETATAY 3 92 

N V +EL + SDG IF+P I TA 

Sbjct: 232 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGV 291 

Query: 393 IVRHGDHFHYIPKS NQIGQPTLPNNSLATPSPSLP 427 

V HGDH+H+IP S ++ GP+ + +PSP 

Sbjct: 292 AVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPA 351 

Query: 428 INPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL-E 486 

NP + ++ +A R + + G+ + Y KDL+ E L + 

Sbjct: 352 PNPQPAPSNP IDEKLVKEAVRKVGD GYVFEENGVPRYIPAKDLSAETAAGIDSKLAK 408 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ SH L++DP++E 
Sbjct: 409 QESLSH KLGAKKTDLPSSDRE 429 

Score = 78.2 bits (191), Expect = 8e-13 

Identities = 112/541 (20%), Positives = 205/541 (37%), Gaps = 61/541 (11%) 
Query: 2 9 RSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEG- IQAEQIVIKITDQGYVTSHGDHY 87 
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r ++ + + N S D + ++ +++ + + +G I +1+ T Y+ HGDHY 

Sbjct: 136 RQKQERSHNHNSRADNAVAAARAQG RYTTDDGYIFNASDIIEDTGDAYIVPHGDHY 191 

Query: 88 HYYNGK^PYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKDAAHADNV 147 

HY +P L + EL + K + + + N+ 

Sbjct: 192 HY 1 PKSDLS ASELAAAQAYWNGKQGSRPS S S S SHNANPAQPRL SENHNL 240 

Query: 14 8 RTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGG 2 07 

++ + E++ + + A S+ R+ +DG + F+PA I T N VPHG 
Sbjct: 241 TVTPTYHQNQGENI - -SSLLRELYAKPLSE-RHVESDGLIFDPAQITSRTANGVAVPHGD 297 

Query: 208 HYHYIPXXXXXXXXXXXXXXXXXXX NMQPSQLSYSSTASDNNTQSVAKGSTSK 260 

HYH+IP + +P Q S ST + + A 

Sbjct: 298 HYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPA 357 

Query: 261 PANKSENLQSLLKELYDSPSAQRYSESDGLV-FDPAKIIS -RTPNGVAIPHGDHYHFIPY 318 

P+N + + L+KE E +G+ + PAK +S T G+ 
Sbjct: 358 PSNPID- -EKLVKEAVRKVGDGYVFEENGVPRYIPAKDLSAETAAGID 403 

Query: 319 SKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYI 3 78 

SKL A +E + + T S N+ + A D + 

Sbjct: 404 SKL-AKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLL 462 

Query: 379 FNPKD I VEETATAYIVRHGDHFHYI PKSNQIGQPTLPNNSLATPS PSLPINPGI SHEKHE 43 8 

KD+ + D ++ + PN + + + + 

Sbjct: 463 ERLKDVSSDKVKLV DD I L AFL AP I RHPERLGKPNAQ I T YTDDE I QVAK - L AGKYTT 517 

Query: 439 EDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSL 498 

EDGY FD I +++ ++ H H+H+ K L+E + AAQ + +E GL 
Sbjct: 518 EDGY I FDPRD I T SDEGDAYVTPHMTHSHW I KKD S L S E AERAAAQAYAKE KGLTPP 572 

Query: 499 SSHEQDYPGN- -AKEMKDLDKKIE- -EKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHH 554 

S+ QD GN AK -f + + + + +K+ Y ++ V + + + I PH DH+ 

Sbjct: 573 STDHQD - SGNTEAKGAEAI YNRVKAAKKVPLDRMPYNLQ YTVEVKNGSLI I PHYDHY 628 

Query: 555 H 555 

H 

Sbjct: 629 H 629 



Score = 68.2 bits (165), Expect = 8e-10 

Identities = 49/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARS QGRYTTNDGYVF 188 

LKD + +D V+ D + I R + K N ++ + + VA+ G+YTT DGY+F 

Sbjct: 465 LKDVS -SDKVKLVDD I LAFLAP I RHPERLGKPNAQ I T YTDDE I QVAKLAGKYTTEDGY IF 523 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 524 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP P STDHQD 578 

Query: 24 9 NTQS VAKGSTSKPANKSENLQSLLKELYDS PS AQRYSESDGLVFDPAKI I SRTPNGVAI P 3 08 

■+ + AKG+ + +Y+ A + D + + + IP 

Sbjct: 579 SGNTEAKGA E A I YNR VKAAKKVPLDRMP YNLQ YT VE VKNGS L 1 1 P 623 

Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 624 HYDHYHNIKF 633 
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Score = 58.2 bits (139), Expect = 8e-07 

Identities = 60/306 (19%), Positives = 112/306 (35%), Gaps = 84/306 (27%) 

Query: 297 IISRTPNGVAIPHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWX 352 

+ 1 T G HGDHYH+ + PY + + 
Sbjct: 52 VI KI TDQG YVTSHGDHYHYYNGKVP YDAI I S 82 

Query: 3 53 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 4 06 

+EL Y DIV E Y+++ G ++ Y+ + 

Sbjct: 83 EELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADN 127 

Query: 4 07 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 45 8 

Q + + +NS A + + G +DGY F+A+ II + +1 

Sbjct: 128 IRTKEEIKRQKQERSHNHNSRADNAVAAARAQG R YTTDDG Y I FNASD I I EDTGDAY I 184 

Query: 459 MSHGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSHNGLDSLSSHEQDY 505 

+ HG+H HY K DL+ AAQ + +SHN + + + + 

Sbjct: 185 VPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTP 244 

Query: 5 06 PGNAKEMKDLDKKIEEKIAGIMKQYGVKRES I VWKEKNAIIYPHGDHHHADPI 55 9 

+ + + + + + + +-f N + PHGDH+H P 

Sbjct: 245 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPY 304 

Query: 56 0 DEHKPV 565 

+ P+ 

Sbjct: 305 SQLSPL 310 



Score =42.7 bits (99), Expect = 0.036 

Identities = 44/186 (23%), Positives = 63/186 (33%), Gaps = 36/186 (19%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI + AY+ H H H+I K + Q L PS 

Sbjct: 516 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 575 

Query: 42 6 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+ G + K E D++ + I+H+HH 

Sbjct: 576 HQ - D S GNTE AKGAE A I YNR VKAAKKVPLDRMP YNLQ YT VE VKNG S L I I PHYDH YHN I KFE 634 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K+E + AK HE SNG+SHQ + + 

Sbjct: 635 WFDEGLYEAPKGYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 694 

Query: 517 KKIEEK 522 

K EEK 
Sbjct: 695 KPNEEK 700 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 2 74 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 3 32 

E+ + A +Y+ DG 4-FDP IS + PH H H+I LS E A+ 
Sbjct: 505 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 564 

Query : 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 
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P + + +T AK E + d +N + V 

Sbjct: 565 KE KGL T P P S TDHQD S GNTE AKG AE A I YNR VKAAK KVPLDRMPYNLQYTV 613 

Query: 3 86 E E TAT AY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 614 EVKNGSLIIPHYDHYHNI 631 



tr Q6WNP8 Surface protein BVH-11-2 (Fragment) [bvh-11-2] [Streptococcus 834 AA 
pneumoniae] 

align 



Score = 417 bits (1071) , Expect = e-115 

Identities = 208/337 (61%) , Positives = 256/337 (75%) , Gaps = 10/337 (2%) 

Query: 21 CAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGY 79 

C+Y L +H++ Q K++NRVSY+DG Q+ QK+ENLTPD+VS-h-hEGI AEQIVIKITDQGY 
Sbjct : 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

Query: 8 0 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLK 13 9 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKVDGKYYVYLK 
Sbjct : 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLK 120 

Query: 140 DAAHADITVRTKDEINRQKQEHVKD-NEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTG 198 

DAAHADN+RTK+EI RQKQE + N + +4- VA AR+QGRYTT+DGY+FN +DIIEDTG 
Sbjct: 121 DAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYI FNASD I I EDTG 180 

Query: 199 NAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ- - -SVAK 255 

+AYIVPHG HYHYIP Q S+ S SS + + N Q S 

Sbjct: 181 DAYIVPHGDHYHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENH 238 

Query: 256 GSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDH 312 

TP N+ EN+ SLL+ELY P ++R+ ESDGL-f FDPA+I SRT NGVA+PHGDH 
Sbjct: 239 NLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDH 298 

Query: 313 YHF I PYS KLS ALEEKI ARRVP I SGTGSTVS TNAKPNE 349 

YHFlPYS-fLS LEEK+AR +P+ + ++4-P + 

Sbjct: 299 YHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQ 335 



Score = 84.3 bits (207), Expect = le-14 

Identities = 90/385 (23%), Positives = 134/385 (34%), Gaps = 73/385 (18%) 

Query: 193 IIEDTGNAYIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNM 234 

+1+ T Y+ HG HYHY P + 

Sbjct : 52 VIKITDQGYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKV 111 

Query: 235 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 293 

Y A+ + + + +S N S + AQ RY+ DG +F+ 

Sbjct: 112 DGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFN 171 

Query: 294 PAKI ISRTPNGVAIPHGDHYHFIPYSKLS ALEEKI ARRVPISGTGSTVST NAKP-- 347 

+11 T + +PHGDHYH+IP S LSA E A+ GS S+ NA P 

Sbjct: 172 ASDIIEDTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQ 231 

Query: 34 8 NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNPKDIVEETATAY 3 92 
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N V -f EL + SDG IF+P I TA 

Sbjct: 232 PRLSENHNLTVTPTYHQMQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGV 291 

Query: 393 I VRHGDHFHYI PKS NQIGQPTLPNNSLATPSPSLP 427 

V HGDH+H+IP S 44 QP+ + +PSP 

Sbjct: 292 AVPHGDHYHFIPYSQLSPLEEKLARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPA 351 

Query: 428 INPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL-E 486 

NP 4 4+ +A R + + G++ Y KDL+ E L 4 

Sbjct: 352 PNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGVPRYIPAKDLSAETAAGIDSKLAK 408 

Query: 4 87 EVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ SH L++DP++E 
Sbjct: 40 9 QESLSH KLGAKKTDLPSSDRE 42 9 



Score = 68.2 bits (165), Expect = 8e-10 

Identities = 49/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 18 8 

LKD 4 +D V+ D+I R 4 K N 44 4 4 VA+ G+YTT DGY+F 

Sbjct: 465 LKD VS - SD KVKL VDD I L AFLAP I RHPERLGKPNAQ I T YTDDE I Q VAKL AGKYTTEDG Y I F 523 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G4AY4 PH H H+I + P ST 4 

Sbjct: 524 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 578 

Query: 24 9 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 3 08 

+ 4 AKG+ 4- +Y+ A + D + +4- 4- 4- IP 

Sbjct: 579 SGNTEAKGA EAI YNRVKAAKKVPLDRMPYNLQYTVE VKNGSL 1 1 P 623 

Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 624 HYDHYHNIKF 63 3 

Score = 58.2 bits (139), Expect = 8e-07 

Identities = 60/306 (19%), Positives = 112/306 (35%), Gaps = 84/306 (27%) 

Query: 2 97 IISRTPNGVAIPHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGS TVS TNAKPNE WX 3 52 

+ 1 T G HGDHYH+ +PY 4 4 
Sbjct: 52 VIKITDQGYVTSHGDHYHYYNGKVPYDAI IS 82 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 406 

+EL Y DIV E Y444 G 44 Y4 4 

Sbjct: 83 EELLMKDPNYQLKDSD I VNE I KGGYVI KVDGKY YVYLKDAAHADN 127 

Query: 407 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 458 

Q 4 4 43SFS A 4 4 G 4DGY F+A4 II 4 +1 

Sbjct: 128 IRTKEEIKRQKQERSHNHNSRADNAVAAARAQG R YTTDDG Y I FNAS D I I EDTGD AY I 184 

Query: 459 MSHGNHNHYFFKKDLTEEQIKAAQKHLE E VKTSHNGLDS LS SHEQD Y 505 

+ HG4H HY K DL4 ++ AAQ 4 + SHN 4 4-44 

Sbjct: 185 VPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTP 244 

Query: 5 06 PGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAIIYPHGDHHHADPI 559 

4- + + + + 4EA44V444 4 44 N4 PHGDH4H P 

Sbjct: 245 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTANGVAVPHGDHYHFIPY 304 
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Query: 560 DEHKPV 565 
+ P+ 

Sbjct: 305 SQLSPL 310 



Score = 47.4 bits (111), Expect = 0.001 

Identities = 61/260 (23%), Positives = 90/260 (34%), Gaps = 51/260 (19%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI + AY+ H H H+I K 4- Q L PS 

Sbjct: 516 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 575 

Query: 426 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+G+KE D++ + I+H+HH 

Sbjct: 576 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 634 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K + E + A K HE S + S Jl Q + + 

Sbjct: 635 WFDEGLYEAPKGYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 694 

Query: 517 KKIEEKIAGIMKQYGVKRES I WNKEKNAI I YPHGDHHHADPIDEHKPVGIGHSHSNYEL 576 

K EEK + E +KE + + P H +D + H VG+ S N L 

Sbjct: 695 KPNEEK PQTEKPEEDKEHDEVSEP- -THPESDEKENH- -VGLNPSADN- -L 73 9 

Query: 577 FKPEEGVAKKEGNKVYTGEE 5 96 

+KP + E T +E 

Sbjct: 74 0 YKPSTDTEETEEEAEDTTDE 75 9 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP IS + PH H H+I LS E A+ 
Sbjct: 505 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 564 

Query: 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 385 

P + + +T AK E + D +N + V 

Sbjct: 565 KE KGL T P P S TDHQD S GNTE AKG AE A I YNR VKAAK KVPLDRMPYNLQYTV 613 

Query: 386 E E TAT AY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 614 E VKNGS L 1 1 PH YDH YHNI 631 



tr Q6WNQ1 Surface protein BVH-11 (Fragment) [bvh-11] [Streptococcus 825 AA 
pneumoniae] 

align 

Score = 415 bits (1066) , Expect = e-114 

Identities - 206/350 (58%), Positives = 257/350 (72%), Gaps = 21/350 (6%) 

Query: 21 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 80 

C+Y L ++++ K+NNRVSY+DG Q++QK+ENLTPD+VS++EGI AEQIVIKITDQGYV 
Sbjct : 1 CSYELGLYQARPVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYV 60 
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Query : 81 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 14 0 

TSHGDHYHYYNGKVPYDA+FSEELLMKDPNY+LKD DIVNEVKGGY+IKVDGKYYVYLKD 
Sbjct: 61 TSHGDHYHYYNGK^PYDAIFSEELLMKDPNYKLKDEDIWEVKGGYVIKVDGKYYVYLKD 12 0 

Query: 141 AAHADNVRTKDEINRQKQEHVKDNE KVNSNVAVARSQGRYTTNDGYVFNPADIIED 196 

AAHADNVRTK+EINRQKQEH 4- E + + VA+ARSQGRYTT+DGY+FN +DIIED 

Sbjct: 121 AAHADNVRTKEE I NRQKQEHS QHREGGTPRNDGAVALARS QGR YTTDDG Y I FNAS D I I ED 180 

Query: 197 TGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 24 8 

TG+AYIVPHG HYHYIP + +Y SDN 

Sbjct: 181 TGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRLQNSDNTPRTNWVP 240 

Query: 249 NTQSVAKGSTSKPANKS-ENLQSLLKELYDSPSAQRYSESDGLVFDPAKIIS 299 

NT + +T+ A+ + S E-H+ SLLK+LY P +QR+ ESDGL+FDPA+I S 

Sbjct: 241 SVSNPGTTNTNTSNNSNTNSQASQSNEDVDSLLKQLYALPLSQRHVESDGLIFDPAQITS 300 

Query: 300 RTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGSTVS TNAKPNE 349 

RT GVA+ PHG+HYHF I P Y + + S LEE + IAR + P+ + + + + P + 

Sbjct: 301 RTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQ 350 



Score = 73.2 bits (178), Expect = 2e-ll 

Identities = 73/288 (25%), Positives = 107/288 (36%), Gaps = 63/288 (21%) 

Query: 2 80 S AQR YS ESDGLVFDPAKI IS RTPNGVAI PHGDHYHF I PYS KLS ALEEKI ARR 331 

S RY+ DG +F+ +11 T + +PHGDHYH+IP ++LSA E A 
Sbjct: 160 SQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNL 219 

Query: 332 VP-ISGTGST VSTNAKPNEWXXXXXXXXXXXXXXXXK 368 

VP +S G+T S N+ N 
Sbjct: 22 0 SNSRTYRLQNSDNTPRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNEDVDSLLKQLYAL 279 

Query: 369 ELSSA SDGYIFNPKDIVEETATAYIVRHGDHFHYIP KS 406 

LS SDG IF + P I TA V HG-fH+H+IP +S 

Sbjct: 280 PLSQRHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRS 339 

Query: 407 NQIGQPTLPNNSLATPSPS - -LPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNH 464 

N + P PSPS NP + ++ +A R + + G++ 
Sbjct: 340 NHWVPDSRPEQPSPQPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGV 396 

Query: 465 NHYFFKKDLTEEQIKAAQKHL - EEVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ Y KDL+ E L + + SH L + + D P + +E 

Sbjct: 397 SRYIPAKDLSAETAAGIDSKLAKQESLSH KLGTKKTDLPSSDRE 440 



Score = 67.0 bits (162), Expect = 2e-09 

Identities = 48/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 

LKD + +D V+ ++I R + K N + + + + VA+ G+YTT DGY+F 

Sbjct: 476 L KDVS - SDKVKLVED I LAFLAP I RHPERLGKPNS Q I T YTDDE I QVAKLAGKYTTEDG Y I F 534 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G+AY+ PH H H+I + p ST + 

Sbjct: 535 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 589 

Query: 24 9 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 3 08 
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4- 4-Y+ A + D + + 4- 4- 4- IP 

EAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IP 634 

Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 635 HYDHYHNIKF 644 



4- 4- AKG+ 
Sbjct: 590 SGNTEAKGA 



Score =53.1 bits (126), Expect = 3e-05 

Identities = 57/315 (18%), Positives = 108/315 (34%), Gaps = 94/315 (29%) 

Query: 297 IISRTPNGVAIPHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WX 352 

+ 1 T G HGDHYH+ +PY 4- + 
Sbjct: 51 VIKITDQGYVTSHGDHYHYYNGKVPYDAIFS 81 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR HGDH 399 

+EL Y +DIV E Y+44 H D+ 

Sbjct: 82 EELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADN 126 

Query: 400 FHYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFI 458 

+ N+ Q + TP + S +4- 4-DGY F+A+ II 4- 4-1 

Sbjct: 127 VRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYI 186 

Query: 459 MSHGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSH 4 92 

4- HG+H HY K +L4 4-4 AA+ L V 
Sbjct: 187 VPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRLQNSDNTPRTNWVPSVSNPG 246 

Query: 4 93 NGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI 54 6 

+ S4-+ ++ 4D4D 4-4- + A 4- Q V4- 4 4~ 4-4-+ + 

Sbjct: 247 TTNTNTSNNSNTNSQASQSNEDVDSLLKQLYALPLSQRHVESDGLIFDPAQITSRTARGV 3 06 

Query: 547 IYPHGDHHHADPIDE 561 

PHG+H4-H P 4-4- 
Sbjct: 3 07 AVPHGNHYHF I P YEQ 321 



Score =43.9 bits (102), Expect = 0.016 

Identities = 62/286 (21%), Positives = 95/286 (32%), Gaps = 43/286 (15%) 

Query: 372 S ASDGY I FNPKD I VEETATAYI VRHGDHFH YI PKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF + P+DI 4- AY+ H H H+I K + Q L PS 

Sbjct: 527 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 586 

Query: 426 LPINPGISHEKHEE DGYGFDANRIIAEDESGFIMSHGNHNH 466 

4 G + K E D44- + I4-H+HH 

Sbjct: 587 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 645 

Query: 4 67 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K T E 4 A K HE S NG 4 S H 4 4 DD 

Sbjct: 646 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNKADQDSKPDED 705 

Query: 517 KKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHA DPIDEHKPVGIGHSH 571 

K 4E 4- K 4-N 4 4- P D D DE 4 4- HS 

Sbjct: 706 KGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQVEHSV 765 

Query: 572 SNYELFKPEEGVAKKEGNKVYTG- -EELTNWNLLKNSTFNNQNFT 615 

N 4-4 E + K 4- E LT + + L T +N + 

Sbjct: 766 INAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTIS 811 
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Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSKLS ALEEKIARR V- 332 

E+ + A +Y+ DG 4-FDP IS + PH H H+I LS E A+ 

Sbjct: 516 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 575 

Query: 33 3 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + D +N + V 

Sbjct: 576 KEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAK KVPLDRMPYNLQYTV 624 

Query: 3 86 EETATAYIVRHGDHFHYI 4 03 

E + 1+ H DH+H I 

Sbjct: 625 EVKNGSLI IPHYDHYHNI 642 



tr Q6WNP5 Surface protein BVH-11-2 (Fragment) [bvh-11-2] [Streptococcus 830 AA 
pneumoniae] 

alig n 

Score = 415 bits (1066), Expect = e-114 

Identities = 206/350 (58%), Positives = 257/350 (72%), Gaps - 21/350 (6%) 

Query: 21 CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 80 

C+Y L ++++ K+NNRVSY+DG Q++QK+ENLTPD+VS++EGI AEQIVIKITDQGYV 
Sbjct : 1 CSYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYV 60 

Query: 81 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKD 14 0 

TSHGDHYHYYNGKVPYDA+FSEELLMKDPNY+LKD D I VNEVKGGY+ IKVDGKYYVYLKD 
Sbjct: 61 TSHGDHYHYYNGKVPYDAIFSEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKD 120 

Query: 141 AAHADNVRTKDE INRQKQEHVKDNE KVNSNVAVARS QGR YTTNDG YVFNP AD 1 1 ED 196 

AAHADNVRTK+E INRQKQEH + E + + VA+ARSQGRYTT+DGY+FN +DIIED 

Sbjct: 121 AAHADNVRTKEEINRQKQEHSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIED 180 

Query: 197 TGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

TG+AYIVPHG HYHYIP + +Y SDN 

Sbjct: 181 TGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRLQNSDNTPRTNWVP 240 

Query: 249 NTQSVAKGSTSKPANKS - ENLQSLLKELYDS PSAQRYSESDGLVFDPAKI IS 299 

NT + +T+ A+ + S E++ SLLK+LY P +QR+ ESDGL+FDPA+I S 
Sbjct: 241 SVSNPGTTNTNTSNNSNTNSQASQSNEDVDSLLKQLYALPLSQRHVESDGLIFDPAQITS 300 

Query: 300 RTPNGVAI PHGDHYHF I P YSKLS ALEEKI ARRVP I SGTGS TVSTNAKPNE 349 

RT GVA+PHG+HYHFIPY ++S LEE+IAR + P+- + + + +P + 

Sbjct: 301 RTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQ 350 

Score = 73.2 bits (178), Expect = 2e-ll 

Identities = 73/288 (25%), Positives = 107/288 (36%), Gaps = 63/288 (21%) 

Query: 280 SAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I PYSKLSALEEKIARR 331 

S RY+ DG + F+ +11 T + + PHGDH YH+ I P ++LSA E A 
Sbjct: 160 SQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRGNL 219 
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Query: 332 VP - ISGTGST VSTNAKPNEWXXXXXXXXXXXXXXXXK 368 

VP +S G+T S N+ N 
Sbjct: 220 SNSRTYRLQNSDNTPRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNEDVDSLLKQLYAL 279 

Query: 369 ELSSA SDGYIFNPKDIVEETATAYIVRHGDHFHYIP KS 406 

LS SDG IP+P I TA V HG+H+H+IP +S 

Sbjct: 280 PLSQRHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRS 339 

Query: 407 NQIGQPTLPNNSLATPSPS - -LPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNH 464 

N + P PSPS NP 4 ++ +A R + + G++ 
Sbjct: 340 NHWVPDSRPEQPSPQPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGV 396 

Query: 465 NHYFFKKDLTEEQIKAAQKHL-EEVKTSHNGLDSLSSHEQDYPGNAKE 511 

4 Y KDL+ E L 44 SH L 4 + D P 4 +E 

Sbjct: 397 SRYIPAKDLSAETAAGIDSKLAKQESLSH KLGTKKTDLPSSDRE 440 

Score = 67.0 bits (162), Expect = 2e-09 

Identities = 48/190 (25%), Positives = 80/190 (41%), Gaps = 30/190 (15%) 

Query: 138 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 188 

LKD 4 +D V+ + + I R + K N 4-4- + + VA4 G-fYTT DGY4F 

Sbjct: 476 LKDVS-SDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIF 534 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 24 8 

4P DI D G+AY-f PH H H+I + p ST 4 

Sbjct: 535 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 589 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

4 + AKG+ + +Y+ A + D + ++ + + IP 

Sbjct: 590 SGNTEAKGA EAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IP 634 

Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 635 HYDHYHNIKF 644 

Score = 55.5 bits (132), Expect = 5e-06 

Identities = 69/373 (18%), Positives = 129/373 (34%), Gaps = 108/373 (28%) 

Query: 239 LSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKII 298 

L + T +NN S G + K+ENL 4- S+ 4G4 4 +1 

Sbjct: 7 LYQARTVKENNRVSYIDGKQA- -TQKTENL TPDEVSKREGINAEQI -VI 52 

Query: 299 SRTPNGVAIPHGDHYHF 1 P YSKLS ALEEKI ARRVP I S GTGSTVSTNAKPNE WXXX 354 

T G HGDHYH4 4PY 4 4 
Sbjct: 53 K I TD QG YVT S HGDH YH Y YNG KVP YDA I F S 81 

Query: 3 55 XXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR HGDHFH 4 01 

+EL Y 4DIV E Y444 H D4 

Sbjct: 82 EELLMKDPNYKLKDEDIWEVKGGYVIKVLGKYYVYLKDAAHADNVR 128 

Query: 402 YIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHE-EDGYGFDANRIIAEDESGFIMS 460 

4 N4 Q 4 TP 4 S 44- 4DGY F4A+ II 4 +1 + 

Sbjct: 129 TKEE INRQKQEHS QHREGGTPRNDGAVALARS QGRYTTDDG YI FNASD 1 1 EDTGDAY I VP 188 



Query: 461 HGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSHNG 4 
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HG+H HY K +L+ ++ AA+ L V 
Sbjct: 189 HGDHYHYIPKNELSASELAAAEAFLSGRGNLSNSRTYRLQNSDNTPRTNWVPSVSNPGTT 24 8 

Query: 495 LDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI I Y 54 8 

+ S++ ++ +D+D + + + A + Q V+ + + + + + + 

Sbjct: 249 NTNTSNNSNTNSQASQSNEDVDSLLKQLYALPLSQRHVESDGLIFDPAQITSRTARGVAV 308 

Query: 54 9 PHGDHHHAD P I DE 561 

PHG+H+H P ++ 
Sbjct: 309 PHGNHYHFIPYEQ 321 



Score = 43.9 bits (102), Expect = 0.016 

Identities = 62/286 (21%), Positives = 95/286 (32%), Gaps = 43/286 (15%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF + P-fDI + AY+ H H H+I K + Q L PS 

Sbjct: 527 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 586 

Query: 42 6 LPINPGISHEKHEE DG YG FDANR 1 1 AED E S GF I MS HGNHNH 4 66 

+ G + K E D + 1+ H +H H 

Sbjct: 587 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 645 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K T E + A K HE S NG + SH+ + DD 

Sbjct: 646 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNKADQDSKPDED 705 

Query: 517 KKIEEKIAGIMKQYGVKRES I WNKEKNAI I YPHGDHHHA DPIDEHKPVGIGHSH 571 

K +E + K +N + + P D D DE + + HS 

Sbjct: 706 KGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAE I PQVEHSV 765 

Query: 572 SNYELFKPEEGVAKKEGNKVYTG- -EELTNWNLLKNSTFNNQNFT 615 

N++ E+K + ELT + + L T+N + 

Sbjct: 766 INAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTIS 811 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP IS + PH H H+I LS E A+ 
Sbjct: 516 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 575 

Query: 33 3 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + D +N + V 

Sbjct: 576 KE KGL T P P S TDHQD S GNTE AKGAE A I YNR VKAAK KVPLDRMPYNLQYTV 624 

Query: 3 86 EETATAYIVRHGDHFHYI 4 03 

E + 1+ H DH+H I 
Sbjct: 625 EVKNGSLIIPHYDHYHNI 642 



tr Q6WNP9 Surface protein BVH-11-2 (Fragment) [Streptococcus pneumoniae] 819 AA 

align 
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Score = 410 bits (1054), Expect = e-113 

Identities = 204/337 (60%), Positives = 255/337 (75%), Gaps = 10/337 (2%) 

Query: 21 CAYALNQHRS-QENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGY 7 9 

C+Y L +H+ + Q K++NRVSY+DG Q+ QK+ENLTPD+VS++EG1 AEQIVIKITDQGY 
Sbjct: 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

Query: 80 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLK 139 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKVDGKYYVYLK 
Sbjct : 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKVXJGKYYVYLK 120 

Query: 140 DAAHADNVRTKDE INRQKQEHVKD - NEKVKTSNVAVARS QGRYTTNDG YVFNPAD I 1 EDTG 198 

DAAHADN+RTK+EI RQKQE + N + ++ VA AR+QGRYTT+DGY+FN +DIIEDTG 
Sbjct: 121 DAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTG 180 

Query: 199 NAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ- - -SVAK 255 

+AYIVPHG H+HYIP Q S + S SS+ + N Q S 

Sbjct: 181 DAYIVPHGNHFHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENH 238 

Query: 256 GSTSKPA- - -NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDH 312 

TP N+ EN+ SLL+ELY P ++R+ ESDGL+FDPA+I SRT GVA+PHG+H 
Sbjct: 239 NLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNH 2 98 

Query: 313 YHF I P YSKLS ALEEKIARRVP I SGTGSTVS TNAKPNE 349 

YHFIPYS++S LEE+IAR +P + + + ++P + 
Sbjct: 299 YHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQ 335 



Score =77.8 bits (190), Expect = le-12 

Identities = 86/385 (22%), Positives = 135/385 (34%), Gaps = 73/385 (18%) 

Query: 193 I IEDTGNAYIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNM 234 

+ 1 + T Y+ HG HYHY p + 

Sbjct: 52 VI KITDQG YVTSHGDHYH YYNGKVP YDAI I SEELLMKDPNYQLKDSD I VNE I KGGYVI KV 111 

Query: 235 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 293 

Y A+ + + + + s N S + AQ RY+ DG + F+ 

Sbjct: 112 DGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFN 171 

Query: 294 PAKI I SRTPNGVAIPHGDH YHF IP YSKLS ALEEKIARRVP I SGTGS TVS T NAKP-- 347 

+ 11 T -f +PHG+H+H+IP s LSA E A+ GS S+ WA P 

Sbjct: 172 ASDIIEDTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQ 231 

Query: 348 NEWXXXXXXXXXXXXXXXXKELSSA SDGYI FNPKDI VEETATAY 392 

N V +EL + SDG IF+P I TA 

Sbjct: 232 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGV 291 

Query: 3 93 IVRHGDHFHYIPKS NQIGQPTLPNNSLATPSPSLP 42 7 

V HG+H+H+IP S ++ QP+ + +psp 

Sbjct: 292 AVPHGNHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQSA 351 

Query: 428 INPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL-E 486 

NP + + + +A R + + G++ + Y K+L+ E L + 

Sbjct: 352 PNPQPAPSNPIDEKLVKEAVRKVGD GYVFEKNGVSRYIPAKNLSAETAAGIDSKLAK 408 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ SH L++DP++E 
Sbjct: 409 QESLSH KLGAKKTDLPSSDRE 429 
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Score =73.6 bits (179), Expect = 2e-ll 

Identities = 110/550 (20%), Positives = 205/550 (37%), Gaps = 79/550 (14%) 

Query: 29 RSQENKDNNRVS YVDGSQSSQKSENLTPDQVSQKEG - IQAEQIVIKITDQGYVTSHGDHY 87 

R 44- + +N S D + ++ 4-4-4 + + +G I 4-1+ T Y+ HG+H+ 

Sbjct: 136 RQKQERSHNHNSRADNAVAAARAQG RYTTDDGYIFNASDI IEDTGDAYIVPHGNHF 191 

Query: 8 8 HYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKV1DGKYYVYLKDAAHADW 14 7 

HY +P L + EL 4 K 4 + + M4- 

Sbjct: 192 HY 1 PKSDLS ASELAAAQAYWNGKQGSRPS S SS SHNANPAQPRL SENHNL 240 

Query: 148 RTKDE INRQKQEHVKDNEKVNSNVAVARS QGRYTTNDG YVFNPAD I 1 EDTGNAYI VPHGG 207 

44- 4 E + + + 4 A S4- R+ +DG +F+PA I T VPHG 
Sbjct: 241 TVTPTYHQNQGENI--SSLLRELYAKPLSE-RHVESDGLIFDPAQITSRTARGVAVPHGN 297 

Query: 208 HYHYIPXXXXXXXXXXXXXXXXXXX NMQPSQLSYSSTASDNNTQSVAKGSTSK 260 

HYH+IP + +P Q S ST + + A 

Sbjct: 298 HYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQSAPNPQPA 357 

Query: 261 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 320 

P4-N + -f L+KE DG VF4 NGV+ + IP 

Sbjct: 358 PSNPID- -EKLVKEAVRKVG DGYVFEK NGVS RYIPAKN 393 

Query: 321 LSA LEE KI ARRVP I S GTGS TVS TNAKP NEWXXXXXXXXXXXXXXXXKE 3 69 

LSA +4 K+A++ +S T+ N+ + 

Sbjct: 3 94 LSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQV 453 

Query: 3 70 LSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPIN 42 9 

AD + KD4- 4 D 44- 4 PN + + + 

Sbjct: 454 DFEALDNLLERLKDVPSDKVKLV DDILAFLAPIRHPERLGKPNAQITYTDDEIQVA 509 

Query: 43 0 PGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVK 489 

4+ + EDGY FD I 4-++ +4 H H+H4 K L4-E 4- AAQ 4- 4E 
Sbjct: 510 K - LAGKYTTEDG Y I FD PRD I TSDEGDAYVTPHMTHSHW I KKDS LS EAERAAAQAYAKE - - 566 

Query: 490 TSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIAGIMKQYGVKRESIWNKEKNA 545 

GL S+ QD GN AK 4- 4- 44-4 +K+ Y 4+ V 4- + 

Sbjct: 567 KGLTPPSTDHQD - SGNTEAKGAEAI YNRVKAAKKVPLDRMPYNLQ YTVEVKNGS 619 

Query: 546 IIYPHGDHHH 555 

4-1 PH DH+H 
Sbjct: 620 LIIPHYDHYH 629 

Score = 67.4 bits (163), Expect = le-09 

Identities = 49/190 (25%), Positives = 79/190 (40%), Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARS QGRYTTNDG YVF 188 

LKD 4D V4 D + I R 4 K N 44 4 4 VA4 G4YTT DGY4F 

Sbjct: 465 LKDVP - SDKV2CLVDD ILAFLAP IRHPERLGKPNAQ IT YTDDE I QVAKLAGKYTTEDG Y I F 523 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

4P DI D G+AY4 PH H H+I + p ST 4 

Sbjct: 524 DPRD I TSDEGDAYVTPHMTHSHW I KKDS LSEAERAAAQAYAKEKGLTP PSTDHQD 578 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 
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+ + AKG+ 
Sbjct: 579 SGNTEAKGA- 



+ +Y+ A + D + + + + + IP 

-EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIP 62 3 



Query: 3 09 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 624 HYDHYHNIKF 633 



Score = 53.5 bits (127), Expect = 2e-05 

Identities = 58/302 (19%), Positives = 109/302 (35%), Gaps = 84/302 (27%) 

Query: 297 1 1 SRTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEVVX 352 

+ 1 T G HGDHYH+ +PY + + 
Sbjct: 52 VI KI TDQGYVTSHGDHYHY YNGKVPYDAI I S 82 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 406 

+EL Y DIV E Y+ + + G ++ Y+ + 

Sbjct: 83 EELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADN 127 

Query: 4 07 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 458 

Q + + +NS A + + G +DGY F+A+ II + +1 

Sbjct: 12 8 IRTKEEIKRQKQERSHNHNSRADNAVAAARAQG R YTTDDG Y I FNASD 1 1 EDTGDAY I 184 

Query: 459 MSHGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSHNGLDSLSSHEQDY 505 

4- HGNH HY K DL+ ++ AAQ + +SHN + + + + 

Sbjct: 185 VPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTP 244 

Query: 506 PGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKE KNAI I YPHGDHHHAD P I 559 

+ + +4-4. + EA4-+V+ + + +++ + PHG-fH+H P 

Sbjct: 245 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPY 304 

Query: 560 DE 561 
+ 

Sbjct: 305 SQ 306 



Score - 43.1 bits (100), Expect = 0.027 

Identities = 44/185 (23%), Positives = 64/185 (33%), Gaps - 36/185 (19%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI + AY+ H H H+I K + Q L PS 

Sbjct: 516 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 575 

Query: 426 LPINPGISHEKHEE DGYGFDANRIIAEDESGFIMSHGNHNH 466 

+ G + K E D++ + I+H+HH 

Sbjct: 576 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 634 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y KTE + AK HE S HG + S H + + DD 

Sbjct: 635 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDKTGFGNASDHVRKNKADQDSKPDED 694 

Query: 517 KKIEE 521 

K+ +E 

Sbjct: 695 KEHDE 699 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 
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Query; 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP I S + PH H H+I LS E A+ 

Sbjct: 505 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 564 

Query: 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 385 

P + + +T AK E + D +N + V 

Sbjct: 565 KEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAK KVPLDRMPYNLQYTV 613 

Query: 3 86 E E TAT AY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 

Sbjct: 614 EVKNGSLIIPHYDHYHNI 631 



tr Q 6T304 Surface protein BVH-11 (Fragment) [bvhll] 807 AA 

[Streptococcus align 
pneumoniae] 



Score = 410 bits (1054) , Expect = e-113 

Identities = 204/337 (60%), Positives = 255/337 (75%), Gaps = 10/337 (2%) 

Query: 21 C A Y ALNQHR S - QENKDNNR VS Y VD G S Q S S Q KS ENL T PDQ VS Q KE G I Q AE QIVIKITDQGY 79 

C+Y L +H++ Q K++NRVSY+DG Q+ QK+ENLTPD+VS++EGI AEQIVIKITDQGY 
Sbjct : 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

Query : 80 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLK 13 9 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKVDGKYYVYLK 
Sbjct : 61 VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLK 12 0 

Query: 14 0 DAAHADNVRTKDEINRQKQEHVKD-NEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTG 198 

DAAHADN+RTK+EI RQKQE + N + ++ VA AR+QGRYTT+DGY+FN +DIIEDTG 
Sbjct: 121 DAAHADNIRTKEE I KRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYI FNASD I I EDTG 180 

Query: 199 NAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ- - -SVAK 255 

+AYIVPHG H+HYIP Q S + S SS+ + N Q S 

Sbjct: 181 DAYIVPHGNHFHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENH 238 

Query: 256 GSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDH 312 

TP EN+ SLL+ELY P +4-R+ ESDGL-f FDPA-f I SRT GVA+PHG+H 

Sbjct: 23 9 NLTVTPTYHQNQGENI S SLLREL YAKPLS ERHVESDGL I FDPAQI TSRTARGVAVPHGNH 2 98 

Query: 313 YHF I P YS KLS ALEEKI ARRVP I SGTGSTVS TNAKPNE 349 

YHFIPYS++S LEE+IAR +P+ + +++P + 

Sbjct: 299 YHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQ 335 



Score =76.6 bits (187), Expect = 2e-12 

Identities = 84/385 (21%), Positives = 133/385 (33%), Gaps = 73/385 (18%) 

Query: 193 I IEDTGNAYIVPHGGHYHYI PXXXXXXXXXXXXXXXXXXXNM 234 

+1+ T Y+ HG HYHY P + 

Sbjct: 52 VI KI TDQG YVTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSD I VNE I KGG YVI KV 111 

Query: 2 35 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQ-RYSESDGLVFD 2 93 

Y A+ + + + +S N S + AQ RY+ DG +F+ 

Sbjct: 112 DGKYYVYLKDAAHADNIRTKEEIKRQKQERSHNHNSRADNAVAAARAQGRYTTDDGYIFN 171 
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Query: 294 PAKI I SRTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGSTVS T NAKP-- 347 

+ II T +■ 4-PHG+H+H+IP S LSA E A+ GS S+ NA P 

Sbjct: 172 ASDIIEDTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQ 231 

Query: 348 NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNPKDIVEETATAY 392 

N V +EL + SDG IF+P I TA 

Sbjct: 232 PRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGV 291 

Query: 3 93 IVRHGDHFHYIPKS NQIGQPTLPNNSLATPSPSLP 42 7 

V HG+H+H+IP S 4-4- QP+ + +PS p 

Sbjct: 292 AVPHGNHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQSA 351 

Query: 428 INPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHL-E 486 

WP + ++ 4-4- + G+ + + Y K+L+ E L + 

Sbjct: 352 PNPQPAPSNPIDEKL VKEWRKVGDGYVFEKNGVSRYIPAKNLSAETAAGIDSKLAK 408 

Query: 4 87 EVKTSHNGLDSLSSHEQDYPGNAKE 511 

+ SH L + 4- D P 4- 4-E 

Sbjct: 409 QESLSH KLGAKKTDLPSSDRE 429 



Score = 75.1 bits (183), Expect = 7e-12 

Identities = 108/546 (19%), Positives = 207/546 (37%), Gaps =71/546 (13%) 

Query: 2 9 RSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEG- IQAEQIVIKITDQGYVTSHGDHY 87 

R ++ + +N S D + + 4- 4-4-4- + + +G I 4-1+ T Y+ HG4-H+ 

Sbjct: 136 RQKQERSHNHNSRADNAVAAARAQG RYTTDDG Y I FNASD I I EDTGDAY I VPHGNHF 191 

Query: 88 HYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLKDAAHADNV 147 

HY +P L + EL + K + + + N+ 

Sbjct: 192 HY 1 PKSDLSASELAAAQAYWNGKQGSRPS S S S SHNANPAQPRL SENHNL 240 

Query: 148 RTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGG 2 07 

+ 4- + E+4- 4-4- A S+ R+ 4-DG +F+PA I T VPHG 
Sbjct: 241 TVTPTYHQNQGENI - -SSLLRELYAKPLSE-RHVESDGLIFDPAQITSRTARGVAVPHGN 297 

Query: 2 08 HYHYIPXXXXXXXXXXXXXXXXXXX NMQPSQLSYSSTASDNNTQSVAKGSTSK 2 60 

HYH+IP + +P Q S ST 4-+ A 

Sbjct: 298 HYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQSAPNPQPA 357 

Query: 261 PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I PYSK 320 

P+N 4- + L + KE+ DG VF4- NGV+ +IP 
Sbjct: 358 PSNPID- -EKLVKEWRKVG DGYVFEK NGVS RYIPAKN 3 93 

Query: 321 LSA LEE K I ARRVP I S GTG S TVS TN - - AKPNE WXXXXXXXXXXXXXXXX KE L S S A 373 

LSA 4-+ K4-A++ 4-S T+ + E 4- 

Sbjct: 3 94 LSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQV 453 

Query: 374 SDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGIS 433 

4- N 4- +E+ 4-4- D 4-4- 4- PN 4- + 4- 4-4- 

Sbjct: 454 DFEALDNLLERLEDVPSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAK-LA 512 

Query: 434 HEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHN 493 

4- EDGY FD I 4-4-+ 4-+ H H4-H+ K L+E 4- AAQ + +E 
Sbjct: 513 GKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKE K 567 

Query: 494 GLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIAGIMKQYGVKRESIWNKEKNAIIYP 549 
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GL S+ QD GN AK + + +++ +K+ Y ++ V + + + I P 

Sbjct: 568 GLTPPSTDHQD - SGMTEAKGAEAI YNRVKAAKKVPLDRMPYNLQ YTVEVKNGSLI IP 623 

Query: 55 0 HGDHHH 555 

H DH+H 
Sbjct: 624 HYDHYH 62 9 

Score = 66.6 bits (161), Expect = 2e-09 

Identities = 46/184 (25%) , Positives = 76/184 (41%) , Gaps = 29/184 (15%) 

Query: 144 ADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVFNPADI I 194 

+D V+ D+I R + K M ++ + + VA+ G+YTT DGY+F+P DI 

Sbjct: 470 S DKVKLVDD I LAFLAP I RHPERLGKPNAQ I TYTDDE I QVAKLAGKYTTEDGY I FD PRD IT 52 9 

Query: 195 EDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVA 254 

D G+AY+ PH H H+I + P ST ++ + A 

Sbjct: 530 S D EGD A YVT PHMTHS H W I KKD S L S E AE RAAAQ A YAKE KGL T P PSTDHQDSGNTEA 584 

Query: 255 KGSTSKPANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYH 314 

KG+ + +Y+ A + D+++ + +IPH DHYH 

Sbjct: 585 KGA EAI YNRVKAAKKVPLDRMP YNLQ YTVE VICNGS L 1 1 PHYDHYH 62 9 

Query: 315 FIPY 318 

I + 

Sbjct: 630 NIKF 633 



Score = 53.5 bits (127), Expect = 2e-05 

Identities = 58/302 (19%), Positives = 109/302 (35%), Gaps = 84/302 (27%) 

Query: 297 I I SRTPNGVAI PHGDHYH F IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWX 352 

+ 1 T G HGDHYH+ +PY + + 
Sbjct: 52 VI KI TDQGYVTSHGDH YHYYNGKVP YDAI I S 82 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKS 406 

+EL Y DIV E Y+++ G ++ Y+ + 

Sbjct: 83 EELLMKDPNYQLKDSDIVNEIKGGYVIKVDGKYYVYLKDAAHADN 127 

Query: 407 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFI 458 

Q + + +NS A + + G +DGY F+A+ II + +1 

Sbjct: 128 IRTKEE I KRQKQERSHNHNSRADNAVAAARAQG R YTTDDGYI FNASD I I EDTGDAYI 184 

Query: 459 MSHGNHNHYFFKKDLTEEQ I KAAQKHLE EVKTSHNGLDSLSSHEQDY 505 

+ HGNH HY K DL+ AAQ + +SHN + +++ 

Sbjct: 185 VPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTP 244 

Query: 506 PGNAKEMKDLDKKIEEKIAGIMKQYGVKRES I WNKEKNAIIYPHGDHHHADPI 559 

+ + + + + 4-EA++V+ + + +++ + PHG+H+H P 

Sbjct: 245 TYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPY 304 

Query: 560 DE 561 
Sbjct: 305 SQ 306 

Score = 44.7 bits (104), Expect = 0.009 

Identities = 45/186 (24%), Positives = 63/186 (33%), Gaps = 36/186 (19%) 
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Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 42 5 

+ DGYIF+P+DI + AY 4- H H H+I K + Q L PS 

Sbjct: 516 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 575 

Query: 426 LPINPGISHEKHEE DGYGFDANRI IAEDESGFIMSHGNHNH 466 

+ G + K E D + + + 1+ H +H H 

Sbjct: 576 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 634 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K T E + A K HE S NG + S H Q + + 

Sbjct: 635 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTE 694 

Query: 517 KKIEEK 522 

K EEK 
Sbjct: 695 KPSEEK 700 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E + + A +Y+ DG +FDP IS + PH H H+I LS E A+ 

Sbjct: 505 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 564 

Query : 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 385 

P + + +T AK E + D +N + V 

Sbjct: 565 KE KGLTPP S TDHQD S GNTE AKGAE AI YNRVKAAK KVPLDRMPYNLQYTV 613 

Query: 3 86 E E TAT AY I VRHGDH FH Y I 4 03 

E + 1+ H DH+H I 
Sbjct: 614 E VKNG S L 1 1 PH YDH YHN I 631 



tr Q6WNQ0 Surface protein BVH-11 (Fragment) [bvh-11] [Streptococcus 811 AA 
pneumoniae] 

align 

Score = 405 bits (1040) , Expect = e-111 

Identities = 203/339 (59%), Positives = 253/339 (73%), Gaps = 13/339 (3%) 

Query: 21 CAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGY 7 9 

C+Y L +H++ Q+ K++NRV+Y+DG Q+ QK+ENLTPD+VS++EGI AEQIVIKITDQGY 
Sbjct : 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

Query : 80 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLK 139 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKV+GKYYVYLK 
Sbj ct : 61 VTSHGDH YHYYNGKVP YDAI I SEELLMKDPNYQLKDSD I VNE I KGG YVI KVNGKYYVYLK 120 

Query: 140 DAAHADNVRTKDE INRQKQEHVKDMEKVNSN VAVARSQGR YTTNDGYVFNPAD I I ED 196 

DAAHADN+ RTK+ E I RQKQEH N SN V AR+QGRYTT+DGY+FN +DIIED 
Sbjct: 121 DAAHADNIRTKEEIKRQKQEH-SHNHGGGSNDQAWAARAQGRYTTDDGYIFNASDIIED 179 

Query: 197 TGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ- - -SV 253 

TG+AYIVPHG H+HYIP Q S+ S SS+ + N Q S 

Sbjct: 180 TGDAYIVPHGMHFHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSE 237 
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Query: 254 AKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHG 310 

TP N+ EN+ SLL+ELY P 44-R4- ESDGL+FDPA+I SRT GVA+PHG 
Sbjct: 238 NHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHG 297 

Query: 311 DHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 349 

+HYHFIPY 44S LEE+IAR 4-P4 4 +++P + 

Sbjct: 298 NHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQ 336 



Score =73.9 bits (180), Expect = le-11 

Identities = 71/272 (26%), Positives = 106/272 (38%), Gaps = 50/272 (18%) 

Query: 2 83 RYSESDGL VFDPAKI I SRTPNGVAI PHGDHYHF I P YS KLS ALEEKI ARRVP I SGTGS TVS 342 

RY+ DG +F+ +11 T + +PHG+H+H+IP S LSA E A+ GS S 

Sbjct: 162 RYTTDDGYIFNASDIIEDTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPS 221 

Query: 343 T NAKP NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNP 381 

+ NA P N V 4EL + SDG IF+P 

Sbjct: 222 S S S SHNANPAQPRLSENHNLTVTPT YHQNQGENI S SLLREL YAKPLS ERHVESDGL I FDP 281 

Query: 3 82 KD I VE E TAT AY I VRHGDH FH Y I P KSNQIGQPTLPNNSLATP 422 

I TA V HG+H+H+IP +SN + P P 

Sbjct: 282 AQITSRTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQP 341 

Query: 423 SPS - -LPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

SPS NP + +4- +A R + 4- G44- + Y KDL4 E 

Sbjct: 342 S PS PQPAPNPQPAPSNP IDEKLVKEAVRKVGD GYVFEENGVSRYIPAKDLSAETAAG 398 

Query: 4 81 AQKHL - EE VKTSHNGLDS LS SHEQD YPGNAKE 511 

L 4-4 SH L 4 4 D P 4 +E 

Sbjct: 399 IDSKLAKQESLSH KLGTKKTDLPSSDRE 426 



Score = 70.1 bits (170), Expect = 2e-10 

Identities = 105/511 (20%) , Positives = 192/511 (37%) , Gaps = 86/511 (16%) 

Query: 71 VIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKV 13 0 

+1+ T Y+ HG+H+HY +P L + EL + K + 

Sbjct: 176 I IEDTGDAYIVPHGNHFHY IPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPA 231 

Query: 131 DGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNP 190 

+ 4- N+ 44 4 E44 4-4 A S+ R+ +DG 4-F4P 

Sbjct: 232 QPRL SENHNLTVTPTYHQNQGENI - -SSLLRELYAKPLSE -RHVESDGLIFDP 281 

Query: 191 ADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXX XXXXNMQPSQLSYSSTASD 247 

A I T VPHG HYH4IP 4 P + 

Sbjct: 282 AQITSRTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQP 341 

Query: 24 8 NNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKI I SRTPNGVAI 3 07 

4 4 A P4N" + 4- L4KE DG VF4 NGV+ 
Sbjct: 342 SPSPQPAPNPQPAPSNPID- -EKLVKEAVRKVG DGYVFEE . NGVS- 383 

Query: 308 PHGDHYHF I P YS KL S A LEEKIARRVPISGTGSTVSTNAKP NEWXXXXX 356 

+ IP LSA 4-4 K4-A4-4 +S T T+ N+ 

Sbjct: 384 R Y I PAKDL S AETAAG IDS KLAKQE S L SHKLGTKKTDL P S S DRE F YNKAYDLLAR 437 

Query: 357 XXXXXXXXXXXKELSSASDGYIFNPKDI VEETATAYI - -VRHGDHFHYIPKSNQ 408 
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+ AD + KD4- + E A+ + +RH + + 
Sbjct: 438 IHQDLLDNKGRQVDFEALDNLLERLKDVSSDKVKLVEDILAFLAPIRHPE R 488 

Query: 409 IGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYF 468 

+G+P N+ + + + + 4- + EDGY FD I + + + ++ H H+H+ 

Sbjct: 489 LGKP NS Q I T YTDDE IQVAK- LAGKYTTEDGY I FDPRD I TSDEGDAYVTPHMTHSHWI 544 

Query: 469 FKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGN- -AKEMKDLDKKIE- -EKIA 524 

K L+E + AAQ + +E GL S+ QD GM AK + + +++ +K+ 

Sbjct: 545 KKD S L S E AERAAAQ A Y AKE KGLTPPSTDHQD - SGNTEAKGAEAI YNRVKAAKKVP 598 

Query: 525 GIMKQYGVKRES IWNKEKNAI I YPHGDHHH 555 

Y ++- V + + + I PH DH+H 

Sbjct: 599 LDRMPYNLQ YTVEVKNGSL I I PHYDHYH 626 



Score = 67.0 bits (162), Expect = 2e-09 

Identities = 48/190 (25%), Positives = 80/190 (41%), Gaps - 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 18 8 

LKD + +D V+ + + I R -f K N ++ + + VA+ G+YTT DGY+F 

Sbjct: 462 LKDVS -SDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIF 520 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+P DI D G+AY+ PH H H+I + P ST + 

Sbjct: 521 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 575 

Query: 24 9 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 3 08 

■f + AKG-f + +Y+ A + D + +-f + + IP 

Sbjct: 576 SGNTEAKGA EAI YNRVKAAKKVPLDRMPYNLQ YTVEVKNGSL I 1 P 620 

Query: 30 9 HGDHYHFIPY 318 

H DHYH I + 
Sbjct: 621 HYDHYHNIKF 63 0 

Score =53.9 bits (128), Expect = 2e-05 

Identities = 55/300 (18%), Positives = 107/300 (35%), Gaps = 79/300 (26%) 

Query: 297 I I SRTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWX 352 

+ 1 T G HGDHYH+ -f-PY + + 
Sbjct: 52 VI KI TDQG YVTSHGDHYHYYNGKVP YDAI I S 82 

Query: 353 XXXXXXXXXXXXXXX KE L S S AS DG Y I FNP KD I VEE TAT AY I VR - HGDH FH Y I P KS NQ I GQ 411 

+EL Y DIV E Y+++ +G ++ Y+ + 

Sbjct: 83 EELLMKDPNYQLKDSD I VNE I KGGYVI KVNGKYYVYLKDAAHADN 127 

Query: 412 PTLPNNSLATPSPSLPINPGISHEKH EEDGYGFDANRI IAEDESGFIMS 460 

+ G S + + + +DGY F+A+ II + +1+ 

Sbjct: 128 IRTKEE I KRQKQEHSHNHGGGSNDQAWAARAQGRYTTDDGYI FNASD I I EDTGDAYI VP 187 

Query: 461 HGNHNHYFFKKDLTEEQI KAAQKHLE EVKTSHNGLDSLSSHEQDY PG 507 

HGNH HY K DL+ ++ AAQ + +SHN + + + + 

Sbjct: 188 HGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTPTY 247 

Query: 508 NAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAIIYPHGDHHHADPIDE 561 

+ + + + + +EA++V+++ + + + + PHG+H+H P ++ 

Sbjct: 248 HQNQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQ 307 
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Score =43.9 bits (102), Expect = 0.016 

Identities = 62/286 (21%) , Positives = 95/286 (32%) , Gaps = 43/286 (15%) 

Query: 372 S ASDGYI FNPKDI VEETATAYI VRHGDHFHYI PKSN QIGQPTLPNNSLATPSPS 425 

+ DGYIF+P+DI + AY + H H H+I K + Q L PS 

Sbjct: 513 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 572 

Query: 426 LPINPGISHEKHEE DGYGFDANRIIAEDESGFIMSHGNHNH 466 

4-G+KE D + + + 1+ H +H H 

Sbjct: 573 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 631 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K T E + A K HE S NG + S H + + DD 

Sbjct: 632 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNKADQDSKPDED 691 

Query: 517 KKIEEKIAGIMKQYGVKRES IWNKEKNAI IYPHGDHHHA DPIDEHKPVGIGHSH 571 

K +E + K +N + + P D D DE + + HS 

Sbjct: 692 KGHDEVSEPTHPESDEKENHAGLNPS ADNLYKPSTDTEETEEEAEDTTDEAE I PQVEHSV 751 

Query: 572 SNYELFKPEEGVAKKEGNKVYTG- -EELTNWNLLKNSTFNNQNFT 615 

N + + E + K + E LT + + L T +N + 

Sbjct: 752 INAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTIS 797 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%) , Positives = 51/138 (36%) , Gaps = 19/138 (13%) 

Query: 274 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 332 

E+ + A +Y+ DG +FDP IS 4- PH H H+I LS E A+ 

Sbjct: 502 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 561 

Query: 3 33 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + D +N + V 

Sbjct: 562 KE KGLTP P S TDHQD S GNTE AKGAE A I YNR VKAAK KVPLDRMPYNLQYTV 610 

Query: 3 86 EETATAY I VRHGDHFHYI 4 03 

E + 1+ H DH+H I 
Sbjct: 611 E VKNGS L 1 1 PH YDH YHN I 628 



tr Q6WNP6 Surface protein BVH-11-2 (Fragment) [bvh-11-2] [Streptococcus 816 AA 
pneumoniae] 

align 

Score = 400 bits (1028) , Expect = e-110 

Identities = 202/339 (59%), Positives = 251/339 (73%), Gaps = 13/339 (3%) 

Query: 21 CAYALNQHRS -QENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGY 7 9 

C+Y L +H++ Q+ K++NRV+Y+DG Q+ QK+ENLTPD+VS++EGI AEQIVIKITDQGY 
Sbjct : 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

Query: 8 0 VTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIWEVKGGYIIKVDGKYYVYLK 13 9 

VTSHGDHYHYYNGKVPYDA+ SEELLMKDPNYQLKD+DIVNE+KGGY+IKV+GKYYVYLK 
Sbjct: 61 VTSHGDH YHYYNGKVPYDAI I S EELLMKDPNYQLKDSD I VNE I KGG YVI KVNGKY YVYLK 120 
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Query: 14 0 DAAHADNVRTKDE INRQKQEHVKDNEKVNSN VAVARS QGRYTTNDG YVFNP AD 1 1 ED 196 

DAAHADN+RTK+EI RQKQEH N SN V AR+QGRYTT+DGY+FN +DIIED 
Sbjct: 121 DAAHADNIRTKEEIKRQKQEH-SHNHGGGSNDQAWAARAQGRYTTDDGYIFNASDIIED 179 

Query : 197 TGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQ SV 253 

TG+AYIVP G H+HYIP Q S+ S SS+ + N Q S 

Sbjct: 180 TGDAYIVPRGNHFHYIP- -KSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSE 237 

Query: 254 AKGSTSKPA NKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHG 310 

TP N+ EN+ SLL+ELY P ++R ESDGL+FDPA+I SRT GVA+PHG 
Sbjct: 238 NHNLTVTPTYHQNQGENISSLLRELYAKPLSERRVESDGLIFDPAQITSRTARGVAVPHG 297 

Query: 311 DHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNE 349 

+HYHFIPY ++S LEE+IAR +P+ + + + 4-P + 

Sbjct: 298 NHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQ 336 



Score = 70.9 bits (172), Expect = le-10 

Identities = 70/272 (25%), Positives = 105/272 (37%), Gaps = 50/272 (18%) 

Query: 283 RYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVS 342 

RY+ DG +F+ + II T + +P G+H+H+IP S LSA E A+ GS S 

Sbjct: 162 RYTTDDGYIFNASDIIEDTGDAYIVPRGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPS 221 

Query: 343 T NAKP -NEWXXXXXXXXXXXXXXXXKELSSA SDGYIFNP 3 81 

+ NA P N V +EL + SDG IF+P 

Sbjct: 222 SSSSHNANPAQPRLSEMHNLTVTPTYHQNQGENISSLLRELYAKPLSERRVESDGLIFDP 281 

Query: 382 KDIVEETATAYIVRHGDHFHYIP KSNQIGQPTLPNNSLATP 422 

I TA V HG-fH-fH+IP -fSN + P P 

Sbjct: 282 AQITSRTARGVAVPHGNHYHFIPYEQMSELEERIARIIPLRYRSNHWVPDSRPEQPSPQP 341 

Query: 423 SPS- -LPINPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKA 480 

SPS NP + ++ +A R + + G++ + Y KDL+ E 

Sbjct: 342 SPSPQPAPNPQPAPSNPIDEKLVKEAVRKVGD GYVFEENGVSRYIPAKDLSAETAAG 398 

Query: 481 AQKHL-EEVKTSHNGLDSLSSHEQDYPGNAKE 511 

L SH L + + D P + +E 

Sbjct: 399 IDSKLAKQESLSH KLGTKKTDLPSSDRE 426 



Score = 67.0 bits (162), Expect = 2e-09 

Identities = 48/190 (25%) , Positives = 80/190 (41%) , Gaps = 30/190 (15%) 

Query: 13 8 LKDAAHADNVRTKDE IN RQKQEHVKDNEKV NSNVAVARSQGRYTTNDGYVF 18 8 

LKD + +D V+ + + I R + K N ++ + + VA+ G+YTT DGY+F 

Sbjct: 462 LKDVS-SDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIQVAKLAGKYTTEDGYIF 520 

Query: 189 NPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDN 248 

+ P DI D G+AY-h PH H H+I + P ST + 

Sbjct: 521 DPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTP PSTDHQD 575 

Query: 249 NTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIP 308 

+ + AKG-f + +Y+ A + D + ++ + + IP 

Sbjct: 576 SGNTEAKGA EAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIP 620 

Query: 309 HGDHYHFIPY 318 



http://us.expasy.org/cgi-bin/blast.pl 



9/27/04 



ExPASy BLAST2 Interface 



Page 61 of 116 



H DHYH I + 
Sbjct: 621 HYDHYHNIKF 630 



Score = 50.8 bits (120), Expect = le-04 

Identities = 54/300 (18%), Positives = 106/300 (35%), Gaps = 79/300 (26%) 

Query: 297 IISRTPNGVAIPHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWX 352 

+ 1 T G HGDHYH+ + PY + + 
Sbjct: 52 VI KITDQGYVTSHGDHYHYYNGKVPYDAI I S 82 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIPKSNQIGQ 411 

+EL Y DIV E Y++ + +G + 4- Y+ + 

Sbjct: 83 EELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADN 127 

Query: 412 PTLPNNSLATPS PSLP INPGI SHEKH EEDGYGFDANRI IAEDESGFIMS 460 

+ G S+++ +DGY F+A+ II + +1+ 

Sbjct: 128 IRTKEEIKRQKQEHSHNHGGGSNDQAWAARAQGRYTTDDGYIFNASDIIEDTGDAYIVP 187 

Query: 461 HGNHNHYFFKKDLTEEQIKAAQKHLE EVKTSHNGLDS LS SHEQD Y PG 507 

GNH HY K DL+ ++ AAQ + -fSHN + + + + 

Sbjct: 188 RGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSENHNLTVTPTY 247 

Query: 50 8 NAKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI I YPHGDHHHADPIDE 561 

+ + + + + + EA+~f-V+4-+ + ++ + PHG+H+H P + + 

Sbjct: 248 HQNQGENISSLLRELYAKPLSERRVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQ 307 



Score = 43.9 bits (102), Expect = 0.016 

Identities = 62/286 (21%), Positives = 95/286 (32%), Gaps = 43/286 (15%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN QIGQPTLPNNSLATPSPS 42 5 

+ DGYIF+P+DI + AY+ H H H+I K + Q L PS 

Sbjct: 513 TTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTD 572 

Query: 426 LPINPGISHEKHEE DGYGFDANR 1 1 AEDESGF IMSHGNHNH 466 

+ G+ KE + I+H+HH 

Sbjct: 573 HQ-DSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFE 631 

Query: 467 YFFKKDLTEEQIKAAQK HLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLD 516 

Y K T E + A K HE SNG+SH+ + DD 

Sbjct: 632 WFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNKADQDSKPDED 691 

Query: 517 KKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHA DPIDEHKPVGIGHSH 571 

K +E + K +N + + P D D DE + + HS 

Sbjct: 692 KGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEIPQVEHSV 751 

Query: 572 SNYELFKPEEGVAKKEGNKVYTG- -EELTNWNLLKNSTFNNQNFT 615 

N ++ E ■ + K + E LT + + L T +N + 

Sbjct: 752 INAKIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNNTIS 797 



Score = 39.7 bits (91), Expect = 0.30 

Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 19/138 (13%) 

Query: 2 74 ELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRV- 3 32 

E + + A +Y+ DG +FDP IS + PH H H+I LS E A+ 

Sbjct: 502 EIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYA 561 
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Query; 333 PISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIV 3 85 

P + + +T AK E + D 4-N + V 

Sbjct: 562 KE KGLTP PS TDHQDS GNTE AKGAE A I YNR VKAAK KVPLDRMPYNLQYTV 610 

Query: 386 EETATAYIVRHGDHFHYI 4 03 

E + 1+ H DH+H I 

Sbjct: 611 EVKNGSLI IPHYDHYHNI 62 8 



tr Q8NZ82 Hypothetical protein spyM18_2072 [spyM18_2072] [Streptococcus 823 AA 
pyogenes (serotype M18)] 

align 



Score = 306 bits (785) , Expect = le-81 

Identities = 198/612 (32%), Positives = 299/612 (48%), Gaps = 74/612 (12%) 

Query: 1 MKFS KKY IAAGS AVI VSLSLCAYALNQHRSQENKDNNRVS YVDGS QS S QKS E - - NLTPDQ 58 

MK + YI + +A+4-4-4- + +Y L +H +N+++Y+D S+ K+ N T DQ 

Sbjct : 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGPVTKDNQIAYIDDSKGKAKAPKTNKTMDQ 60 

Query : 59 VSQKEG I QAEQ I VI KI TDQGYVTSHGDHYHYYNGKVP YDALFS EELLMKDPNYQLKDAD I 118 

+S +EGI AEQ I V+ KI TDQG YVTSHGDH YH+ YNGKVP YDA+ SEELLM DPNY+ K +D + 
Sbjct : 61 I S AEEG I S AEQ I WKI TDQGYVTSHGDHYHF YNGKVP YDAI I SEELLMTDPNYRFKQSDV 120 

Query: 119 VNE VKGGYI I KVTDGKY YVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVA 172 

+NE+ GY+IKV+G YYVYLK + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 INE I LDGYVI KVNGNY YVYLKPGS KRKNI RTKQQI AEQVAKGTKEAKEKGLAQVAHLS KE 180 

Query : 173 VARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXX 226 

A+ QGRYTT+DGY+F+P DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 EVAAVNEAKRQGRYTTDDGY I FS PTD I IDDLGDAYLVPHGNHYH YI PKKDLS PSELAAAQ 240 

Query: 227 XXXXXXNMQPSQLS-YSSTASDNNTQSVAKGSTSKPAN 263 

+++SYT+ ++ TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPNPRQGHQPDNGGYHPAPPRPNDASQN 3 00 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFI 316 

K + + LL +L+ R+ E DGL + F+P + + I G +PHG.DHYH I 

Sbjct: 301 KHQRDE F KGKTFKELLDQLHRLDLKYRHVE EDGL I FE PTQVI KSNAFGY WPHGDH YH 1 1 360 

Query: 317 PYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKELSS 372 

P S+LS LE ++A R T + S ++KP+ EV + 

Sbjct: 361 PRSQLSPLEMELADRYLAGQTEDNDSGSDHSKPSDKEVTHTFLGHRIKAYGKGLDGKPYD 420 

Query: 373 AS DG Y I FNP KD I VEE T ATAY I VRHGDHFH Y I PKSNQ I GQ PTLPN - - NS L AT P S PS L P I NP 430 

SD Y+F+ 4-1 + +HGDHFHYI Q L N + + + 

Sbjct: 421 TSDAYVFSKESIHSVDKSGVTAKHGDHFHYI -GFGELEQYELDEVANWVKAKGQADELAA 479 

Query: 431 GISHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLE 486 

+ +E FD + ++ + -f G+IM + + + 4- + L QI A++ L 

Sbjct: 480 ALDQEQGKEKPL - FDTKKVSRKVTKDGKVGYIMPKDGKDYFYARDQLDLTQIAFAEQELM 538 

Query: 4 87 EVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAI 54 6 

H D +D IE ++A + + + + + + 

Sbjct: 539 LKDKKHYRYD I VDTG I E PRLAVD VS S L PMHAGNAT YDTGS S F V 581 
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Query: 547 IYPHGDHHHADP 558 

I PH DH H P 
Sbjct: 582 I-PHIDHIHWP 592 

Score = 51.2 bits (121), Expect = le-04 

Identities = 54/313 (17%), Positives = 109/313 (34%), Gaps = 35/313 (11%) 

Query: 67 AEQIVIKITDQGYVTSHGDHYHY- -YNGKVPYD ALFSEELLM 106 

+ + + + + g HGDH+HY + Y+ A 4-E 

Sbjct: 428 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 487 

Query: 107 KDPNYQLKDAD- - IWEVKGGYIIKVDGKYYWLKDAAHADNVRTKX)EINRQKQEHVKDN 164 

+ P + K + + K GYI4- DGK Y Y +D + ++ + +KD 

Sbjct: 488 EKPLFDTKKVSRKVTKDGKVGYIMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 542 

Query: 165 EKVNSNVAVARSQGRYTTN-DGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXX 223 

4- +4- + R + + + DTG+ + 4- ++PH H H +P 

Sbjct: 543 KHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 602 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 2 83 

+ + P S + + + -f +E+ + + R 

Sbjct: 603 TIKYVMQHPEVRPDVWSKPGHEESGSVIPNVTPLDKRAGMPNWQIIHSAEEVQKALAEGR 662 

Query: 284 YSESDGLVFDPAKIISR TPNGVAI PH - - GDHYHF I PYSKLS ALEEKI ARRVP I SG 336 

4-4- DG 4-FDP + + + + + IP G 1 S LS E + A4- 4- 

Sbjct: 663 FATPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 722 

Query: 3 37 TG S TVS TNAKPNE 34 9 

4- KP E 
Sbjct: 72 3 NAGDATDTDKPKE 73 5 

Score =38.1 bits (87), Expect =0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G I + +V+ T Y+ HGDH4-H+ + +P + 

Sbjct: 62 SAEEG- ISAEQI WKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 43 2 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV-- 48 8 

IS E D Y F 4- 4-1 E G+ + + + + + K + 1+ Q+ E+V 

Sbjct: 102 ISEELLMTDPNYRFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAK 161 

Query: 489 KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 545 

4- GL +4- 4-4- E K 4- + 4- + 4-4-4- 4-A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGY I F S PTD I IDDLGDA 214 

Query: 546 I I YPHGDHHHADPIDEHKP 564 

4- PHG+H+H P + P 
Sbjct: 215 YL VPHGNH YH Y I PKKDL S P 233 



tr Q8E4U1 Hypothetical protein gbsl306 [gbsl306] [Streptococcus 822 
agalactiae 
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(serotype III)] align 
Score = 305 bits (782) , Expect = 2e-81 

Identities = 197/612 (32%), Positives = 299/612 (48%), Gaps = 74/612 (12%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE- -NLTPDQ 58 

MK + YI + +A++++ + +Y L +N+++Y+D S+ K+ NT DQ 

Sbjct: 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGLATKDNQIAYIDDSKGKAKAPKTNKTMDQ 60 

Query: 5 9 VSQKEG I QAEQ I VI KI TDQGYVTSHGDHYHY YNGKVP YDALFSEELLMKDPNYQLKDAD I 118 

+S +EGI AEQIV+KITDQGYVTSHGDHYH+ YNGKVP YDA+ SEELLM DPNY+ K +D+ 
Sbjct : 61 ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYRFKQSDV 120 

Query: 119 WEVKGGYI IKVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD -NEKVNSNVA 172 

+NE+ GY+IKV+G YYVYLK + N-fRTK +1 Q + K+ EK + VA 
Sbjct: 121 INE I LDGYVI KVNGNY YVYLKPGS KRKNI RTKQQ I AEQVAKGTKEAKEKGLAQVAHLS KE 18 0 

Query: 173 VARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXX 226 

A+ QGRYTT+DGY+F+P DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 EVAAVNEAKRQGRYTTDDGYIFSPTDIIDDLGDAYLVPHGNHYHYIPKKDLSPSELAAAQ 240 

Query: 227 XXXXXXNMQPSQLS-YSSTASDNNTQSVAKGSTSKPAN 263 

+ ++SYT+ ++ TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPNPGQGHQPDNGGYHPAPPRPNDASQN 300 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFI 316 

K + + LL +L+ R+ E DGL+F+P ++I G +PHGDHYH I 

Sbjct: 301 KHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHYHII 360 

Query: 317 PYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKELSS 3 72 

P S+LS LE ++A R T + S ++KP+ EV + 

Sbjct: 361 PRSQLSPLEMELADRYLAGQTEDNDSGSEHSKPSDKEVTHTFLGHRIKAYGKGLDGKPYD 420 

Query: 373 ASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN- -NSLATPSPSLPINP 430 

SD Y+F+ + I + +HGDHFHYI ++ Q L N + * + 

Sbjct: 421 TSDAYVFSKESIHSVDKSGVTAKHGDHFHYI-GFGELEQYELDEVANWVKAKGQADELAA 479 

Query: 431 GISHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLE 4 86 

+ E+ +E FD 4- + + + + G++M + + + + + L QI A+4- L 

Sbjct: 480 ALDQEQGKEKPL-FDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQIAFAEQELM 538 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAI 546 

H D +D IE +4-A + + + + + + 

Sbjct: 539 LKDKKHYRYDI VDTGIEPRLAVDVSSLPMHAGNATYDTGSSFV 581 

Query: 547 IYPHGDHHHADP 55 8 

I PH DH H P 
Sbjct: 582 I-PHIDHIHWP 592 

Score =49.3 bits (116), Expect = 4e-04 

Identities = 53/313 (16%), Positives = 109/313 (33%), Gaps - 35/313 (11%) 

Query: 67 AEQIVIKI TDQGYVTSHGDHYHY- -YNGKVP YD ALFSEELLM 106 

+++ + + G HGDH+HY + Y+ A +E 

Sbjct: 428 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 487 

Query: 107 KDPNYQLKDAD- - IVIJEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN 164 
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+ p + K + + K GY++ DGK Y Y +D + ++ + +KD 

Sbjct: 488 EKPLFDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 542 

Query: 165 EKVNSNVAVARSQGRYTTN - DG YVFNPAD I I EDTGNAY I VPHGGHYHY I PXXXXXXXXXX 223 

+ ++ + R + + + DTG+++++PH H H +P 

Sbjct: 543 KHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 602 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 283 

+ + p s + + + + + + R 

Sbjct: 603 TVKYVMQHPEVRPDVWSKPGHEESGSVIPNVTPLDKRAGMPNWQIIHSAEEVQKALAEGR 662 



Query: 


284 


YSESDGLVFDPAKIISR TPNGVAIPH- -GDHYHFIPYSKLSALEEKIARRVPISG 


336 




+ 4- DG +FDP 4- + + + +IP G I S LS E + A+ + 




Sbjct : 


663 


FATPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 


722 


Query: 


337 


TGS TVS TNAKPNE 34 9 








+ KP E 




Sbjct : 


723 


NTGD ATDTD KPKE 73 5 




Score 


=38.1 bits (87), Expect =0.88 




Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16? 


0 


Query: 


372 


SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 


431 






SA +G I + +V4- T Y+ HGDH+H+ + +P + 




Sbjct : 


62 


SAEEG-ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 


101 


Query : 


432 


ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV-- 


488 




IS E D Y F + +1 E G+++ + + + K + 1+ E+V 




Sbjct : 


102 


ISEELLMTDPNYRFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAK 


161 


Query: 


489 


KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 


545 






+ GL + + ++ E K + + + + + + + +A 




Sbjct : 


162 


GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDIIDDLGDA 


214 


Query: 


546 


IIYPHGDHHHADPIDEHKP 564 








+ PHG+H+H P + P 




Sbjct : 


215 


YLVPHGNHYHYI PKKDLS P 233 




tr Q8DZ81 Streptococcal histidine triad family protein 


822 






[SAG12 33] 


alig 






[Streptococcus agalactiae (serotype V)] 




Score 




305 bits (782), Expect = 2e-81 




Identities = 198/612 (32%), Positives = 298/612 (48%), Gaps = 74/612 (12%) 


Query: 


1 


MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE - -NLTPDQ 


58 






MK + YI + +A++++ + +Y L +H +N+++Y+D S+ K+ NT DQ 




Sbjct : 


1 


MKKTYGYIGSVAAILLATHIGSYQLGKHHMGLATKDNQIAYIDDSKGKVKAPKTNKTMDQ 


60 


Query: 


59 


VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 


118 






+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY K +D+ 




Sbjct : 


61 


ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYHFKQSDV 


120 



AA 



Query: 119 WEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKD-NEKVNSWA 172 
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+NE+ GY+IKV+G YYVYLK + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 INEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKE 180 

Query: 173 VARS QGR YTTNDG YVFNP AD 1 1 EDTGNAY I VPHGGH YH Y I PXXXXXXXXXXXXX 226 

A+ QGRYTT+DGY+F-fP DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 EVAAVNEAKRQGRYTTDDGYI FS PTD I IDDLGDAYLVPHGNHYH YI PKKDLS PS ELAAAQ 240 

Query: 22 7 XXXXXXNMQPSQLS-YSSTASDNNTQSVAKGSTSKPAN 2 63 

+ ++ S Y T + ++ TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPNPGQGHQPDNGGYHPAPPRPNDASQN 3 00 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFI 316 

K + + LL +L+ R+ E DGL+F+P ++I G +PHGDHYH I 

Sbjct: 301 KHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHYHII 360 

Query: 317 PYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKELSS 372 

P S+LS LE ++A R T + S ++KP+ EV + 

Sbjct: 361 PRSQLSPLEMELADRYLAGQTDDNDSGSDHSKPSDKEVTHTFLGHRIKAYGKGLDGKPYD 420 

Query: 373 ASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN- -NSLATPSPSLPINP 430 

SD Y+F+ +1 + +HGDHFHYI ++ Q L N + + + 

Sbjct: 421 TSDAYVFSKESIHSVDKSGVTAKHGDHFHYI-GFGELEQYELDEVANWVKAKGQADELVA 479 

Query: 431 GISHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLE 4 86 

+ E+ +E FD + + + + + G + IM ++ + + 4- L QI A++ h 

Sbjct: 480 ALDQEQGKEKPL - FDTKKVSRKVTKDGKVGYIMPKDGKDYFYARYQLDLTQIAFAEQELM 538 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAI 546 

H D +D IE ++A + + + + + + 

Sbjct: 539 LKDKKHYRYD I VDTGIEPRLAVDLSSLPMHAGNATYDTGSSFV 581 

Query: 547 IYPHGDHHHADP 55 8 

I PH DH H P 
Sbjct: 582 I-PHIDHIHWP 592 



Score = 47.8 bits (112), Expect = 0.001 

Identities = 53/313 (16%), Positives - 108/313 (33%), Gaps = 35/313 (11%) 

Query: 67 AEQ I VI KI TDQGYVTSHGDHYHY - - YNGKVP YD ALFSEELLM 106 

+ + + + + G HGDH+HY + Y+ A +E 

Sbjct: 428 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELVAALDQEQGK 487 

Query: 107 KDPNYQLKDAD- - IVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN 164 

+ P + K + + K GYI+ DGK Y Y + + ++ + +KD 

Sbjct: 488 EKPLFDTKKVSRKVTKDGKVGYIMPKDGKDYFYARYQLDLTQIAFAEQ ELMLKDK 542 

Query: 165 EKVNSNVAVARSQGRYTTN-DGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXX 223 

+ ++ + R + + + DTG+++++PH H H +P 

Sbjct: 543 KHYRYDIVDTGIEPRLAVDLSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRNQIA 602 

Query: 224 XXXXXXXXXNMQPSQLS YS STASDNNTQS VAKGSTS KPANKSENLQSLLKELYDSPS AQR 2 83 

+ + p s + + + + +E+ + + R 

Sbjct: 603 T I KYVMQHPE VRPDVWS KPGHEESGS VI PNVTPLDKRAGMPNWQ I IHS AEE VQKALAEGR 662 

Query: 284 YSESDGLVFDPAKI ISR TPNGVAI PH - - GDH YHF I PYS KLS ALEEKI ARRVP I SG 336 

++ DG +FDP ++++ +IP G I S LS E + A+ -f 

Sbjct: 663 FAAPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 722 
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Query: 337 TGSTVSTNAKPNE 34 9 

+ KP E 
Sbjct: 72 3 NAGDATDTDKPEE 73 5 

Score = 38.1 bits (87), Expect = 0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G I + +V+ T Y+ HGDH+H+ + +P + 

Sbjct: 62 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+ + 4- + + + K + 1+ Q+ E+V 

Sbjct: 102 I SEELLMTDPNYHFKQSDVINE I LDG YVI KVNGNY YVYLKPGS KRKNIRTKQQ I AEQVAK 161 

Query: 489 - - -KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 54 5 

4- GL ++ +4- E K + + + + + + + +A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 214 

Query: 546 I I YPHGDHHHADPIDEHKP 564 

+ PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHYIPKKDLSP 233 



tr Q9ZHG7 Hypothetical protein [Streptococcus agalactiae] 822 AA 

align 

Score = 305 bits (781) , Expect = 3e-81 

Identities = 198/612 (32%), Positives = 298/612 (48%), Gaps = 74/612 (12%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE- -NLTPDQ 58 

MK + YI + +A++++ + +Y L +H +N+++Y+D S+ K+ N T DQ 

Sbjct : 1 MKKT YG Y I GS VAAI LL ATH I GS YQLGKHHMGLATKDNQ I AY IDD S KGKVKAP KTNKTMDQ 60 

Query : 5 9 VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 118 

+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY K +D+ 
Sbjct : 61 ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYHFKQSDV 120 

Query: 119 WEVKGGYIIKVDGKYYVYLKDAAHADNV^ 172 

+NE+ GY+IKV+G YYVYLK + N+RTK + 1 Q + K+ EK + VA 
Sbjct: 121 INEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKE 180 

Query: 173 VARS QGR YTTNDG YVFNP AD I I EDTGNAY I VPHGGHYH Y I PXXXXXXXXXXXXX 226 

A+ QGRYTT+DGY+F+P DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 EVAAVNE AKRQGRYTTDDGY I FS PTD I IDDLGDAYLVPHGNHYHY I PKKDLS PS ELAAAQ 240 

Query: 227 XXXXXXNMQPSQLS - YSSTASDNNTQSVAKGSTSKPAN 263 

+++SYT+ + + TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPNPGQGHQPDNGGYHPAPPRPNDASQN 300 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFI 316 

K + + LL +L+ R+ E DGL+F-fP ++I G +PHGDHYH I 

Sbjct: 301 KHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHYHII 360 
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Query: 317 PYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKELSS 372 

P S+LS LE ++A R T + S 4-+KP+ EV + 

Sbjct: 361 PRSQLSPLEMELADRYLAGQTDDNDSGSDHSKPSDKEVTHTFLGHRIKAYGKGLDGKPYD 420 

Query: 373 ASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN- -NSLATPSPSLPINP 430 

SD Y+F+ 4- I + +HGDHFHYI ++ Q L N + + + 

Sbjct: 421 TSDAYVFSKESIHSVDKSGVTAKHGDHFHYI-GFGELEQYELDEVANWVKAKGQADELVA 479 

Query: 431 GISHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLE 4 86 

+ E+ +E FD + ++ + + G+IM ++++ + L QI A+ + L 

Sbjct: 480 ALDQEQGKEKPL-FDTKKVSRKVTKDGKVGYIMPKDGKDYFYARYQLDLTQIAFAEQELM 538 

Query: 4 87 EVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAI 546 

H D +D IE ++A + + + + + + 

Sbjct: 539 LKDKKHYRYD I VDTGIEPRLAVDVSSLPMHAGNATYDTGSSFV 581 

Query: 547 I YPHGDHHHAD P 558 

X PH DH H P 
Sbjct: 582 I-PHIDHIHWP 592 



Score = 47.8 bits (112), Expect = 0.001 

Identities - 53/313 (16%), Positives = 108/313 (33%), Gaps = 35/313 (11%) 

Query: 67 AEQ I VI KI TDQGYVTSHGDHYHY - - YNGKVP YD ALFSEELLM 106 

+ + + + + G HGDH+HY + Y+ A +E 

Sbjct: 428 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELVAALDQEQGK 4 87 

Query: 107 KDPNYQLKDAD- - IVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN 164 

+ P + K + + K GYI + DGK Y Y 4- + ++ + +KD 

Sbjct: 488 EKPLFDTKKVSRKVTKDGKVGYIMPKDGKDYFYARYQLDLTQIAFAEQ ELMLKDK 542 

Query: 165 E KVNSNVAVARS QGR YTTN - DGYVFNP AD 1 1 EDTGNAY I VPHGGH YH Y I PXXXXXXXXXX 223 

+ + + + R + + + DTG+++++PH H H +P 

Sbjct: 543 KHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRNQIA 602 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 283 

+ + p s + + + + +E+ + + R 

Sbjct: 603 TIKYV1VIQHPEVRPDWSKPGHEESGSVIPNVTPLDKRAGMPNWQIIHSAEEVQKALAEGR 662 

Query: 284 YSESDGLVFDPAKIISR TPNGVAI PH - - GDHYHF I PYS KLSALEEKI ARRVPI SG 336 

+ 4- DG +FDP 4- + + 4- +IP G I S LS E + A+ 4- 

Sbjct: 663 FAAPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 722 

Query: 3 37 TGSTVSTNAKPNE 34 9 

+ KP E 
Sbjct: 72 3 NAGDATDTDKPEE 73 5 



Score = 38.1 bits (87), Expect = 0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G I + +V+ T Y+ HGDH+H-f + +P + 

Sbjct: 62 SAEEG-ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+4-4- 4- 4- 4- K + 1+ Q+ E+V 



http : //us. expasy.org/cgi-bin/blast .pi 



9/27/04 



ExPASy BLAST2 Interface 



Page 69 of 116 



Sbjct: 102 ISEELLMTDPNYHFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAK 161 

Query: 489 KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 545 

+ GL + + ++ E K + + + + + + + +A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYI FS PTD I IDDLGDA 214 

Query: 546 IIYPHGDHHHADPIDEHKP 564 

+ PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHYIPKKDLSP 2 33 



tr Q99X V4 Hypothetical protein SPy2006 [SPy2006] [Streptococcus 825 

pyogenes] ^ 

align 

Score = 304 bits (779), Expect = 5e-81 

Identities = 196/614 (31%), Positives = 297/614 (47%), Gaps = 76/614 (12%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE - -NLTPDQ 58 

MK + YI + +A++++ + +Y L +H +N+++Y+D S+ K+ N T DQ 

Sbjct: 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGSATKDNQIAYIDDSKGKAKAPKTNKTMDQ 60 

Query: 5 9 VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 118 

+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY+ K +D+ 
Sbjct: 61 ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYRFKQSDV 120 

Query: 119 VNE VKGGYI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVA 172 

GY+IKV+G YYVYLK + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 INEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKE 180 

Query: 173 VARS QGRYTTNDGYVFNP AD 1 1 EDTGNAY I VPHGGH YHY IP XXXXXXXXXX 223 

A+ QGRYTT+DGY+F+P DII+D G-f AY+VPHG H YHY IP 
Sbjct: 181 E VAAVNE AKRQGR YTTDDG Y I F S PTD I IDDLGDAYL VPHGNH YH Y I PKKDL S P S EL AAAQ 240 

Query: 22 4 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPAN 2 63 

+ PS + + ++ TP 
Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPAPGRRKAPIPDVTPNPGQGHQPDNGGYHPAPPRPNDAS 300 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYH 314 

K + + LL +L+ R+ E DGL+F+P ++I G +PHGDHYH 

Sbjct: 3 01 QNKHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHYH 360 

Query: 315 FIPYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKEL 370 

IP S+LS LE ++A R T + S ++KP+ EV + 

Sbjct: 361 IIPRSQLSPLEMELADRYLAGQTEDDDSGSDHSKPSDKEVTHTFLGHRIKAYGKGLDGKP 420 

Query: 371 S SASDGYI FNPKD I VEETATAYI VRHGDHFHYI PKSNQIGQPTLPN - -NSLATPSPSLPI 428 

SD Y+F+ +1 + +HGDHFHY1 Q L N + + + 

Sbjct: 421 YDTSDAYVFSKESIHSVDKSGVTAKHGDHFHYI-GFGELEQYELDEVANWVKAKGQADEL 479 

Query: 42 9 NPGISHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKH 4 84 

4- E+ +E FD + ++ + + G++M + + + + + L QI A+ + 

Sbjct: 480 AAALDQEQGKEKPL-FDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQIAFAEQE 538 

Query: 4 85 LEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKN 544 

L H D +D IE ++A + + + + + 
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Sbjct: 53 9 LMLKDKKHYRYDI - 



- VDTGI EPRLAVDVS SLPMHAGNATYDTGS S 581 



Query: 545 AI I YPHGDHHHADP 55 8 

+1 PH DH H P 
Sbjct: 5 82 FVI-PHIDHIHWP 5 94 



Score = 49.7 bits (117), Expect = 3e-04 

Identities = 53/313 (16%), Positives = 109/313 (33%), Gaps = 35/313 (11%) 

Query: 67 AEQIVIKITDQGYVTSHGDHYHY- - YNGKVPYD ALFSEELLM 106 

+ + + +4- G HGDH+HY + Y+ A +E 

Sbjct: 430 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 489 

Query: 107 KDPNYQLKDAD- -IWEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN 164 

+ p + K + + K GY+ + DGK Y Y +D + ++ + +KD 

Sbjct: 490 EKPLFDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 544 

Query: 165 E KVNSNVAVARS QGR YTTN - DG YVFNPAD 1 1 EDTGNAY I VPHGGH YHY I PXXXXXXXXXX 223 

+ + + + r + + + DTG+++++PH H H +P 

Sbjct: 545 KHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 604 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 283 

++ p s + + + + +E+ + + R 

Sbjct: 605 T I KYVMQHPEVRPD I WS KPGHEESGS VI PNVTPLDKRAGMPNWQ I IHS AEE VQKALAEGR 664 

Query: 284 YSESDGLVFDPAKI ISR TPNGVAI PH - - GDHYHF I PYS KLS ALEEKI ARRVP I SG 336 

+ + DG +FDP 4- + + + +IP G I S LS E + A+ + 

Sbjct: 665 FATPDGYIFDPRDVLAKETFWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 724 

Query: 337 TGSTVSTNAKPNE 34 9 

4- KP E 
Sbjct: 725 NAGDATDTDKPKE 737 

Score = 38.1 bits (87), Expect = 0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G 1 + +V+ T Y+ HGDH+H+ + +P + 

Sbjct: 62 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+++ + + + K + 1+ Q+ E+V 

Sbjct: 102 ISEELLMTDPNYRFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAK 161 

Query: 489 KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRES I WNKEKNA 545 

+ GL ++ ++ E K + + + + + + + +A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 214 

Query: 546 I IYPHGDHHHADPIDEHKP 564 

4- PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHYIPKKDLSP 23 3 



tr Q8K5Q1 Histidine triad protein [SpyM3_1724] [Streptococcus pyogenes 823 AA 
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(serotype M3) ] 

al ign 

Score = 304 bits (778), Expect = 7e-81 

Identities = 195/612 (31%), Positives = 295/612 (47%), Gaps = 74/612 (12%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE- -NLTPDQ 58 

MK + YI + +A++++ + +Y L +H +N+++Y+D S+ K+ NT DQ 

Sbjct: 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGLATKDNQIAYIDDSKGKAKAPKTNKTMDQ 60 

Query: 5 9 VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 118 

+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY K +D+ 
Sbjct: 61 ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYHFKQSDV 120 

Query: 119 VNE VKGGYI I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVA 172 

+NE+ GY+IKV+G YYVY+K + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 I NE I LDG YVI KVNGN Y YVY I KPGS KRKN I RTKQQ I AEQ VAKGTKE AKE KGL AQ VAHL S KE 180 

Query: 17 3 VARS QGRYTTNDG YVFNP AD I I EDTGNAY I VPHGGH YH Y I PXXXXXXXXXXXXX 22 6 

A+ QGRYTT+DGY+F+P DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 EVAAVNEAKRQGRYTTDDGYIFSPTDIIDDLGDAYLVPHGNHYHYIPKKDLSPSELAAAQ 240 

Query: 227 XXXXXXNMQPSQLS - YS STASDNNTQS VAKGSTS KPAN 263 

-f ++ S Y T 4- ++ TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPNPGQGHQPDNGGYHPAPPRPNDASQN 300 

Query: 264 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFI 316 

K + + LL +L+ R+ E DGL+F+P ++I G +PHGDHYH I 

Sbjct: 301 KHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHYHII 360 

Query: 317 PYSKLSALEEKIARRVPISGTGSTVS TNAKPNEWXXXXXXXXXXXXXXXXKELSS 3 72 

P S+LS LE + +A R T S + + EV + 

Sbjct: 361 PRSQLSPLEMELADRYLAGQTEDNDSGSDHSKSSDKEVTHTFLGHRIKAYGKGLDGKPYD 420 

Query: 3 73 ASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN- -NSLATPSPSLPINP 43 0 

SD Y+F+ +1 + +HGDHFHYI ++ Q L N + + + 

Sbjct: 421 TSDAYVFSKESIHSVDKSGVTAKHGDHFHYI-GFGELEQYELDEVANWVKAKGQADELAA 479 

Query: 4 31 GISHEKHEEDGYGFD ANRI I AEDESGF IMSHGNHNHYFFKKDLTEEQ I KAAQKHLE 4 86 

+ E+ +E FD + ++ + + G+IM ++++ + L QI A+ + L 

Sbjct: 480 ALDQEQGKEKPL-FDTKKVSRKVTKDGKVGYIMPKDGKDYFYARDQLDLTQIAFAEQELM 538 

Query: 487 EVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAI 546 

+ H D IS ++A + + + + + + 

Sbjct: 539 LKDKNHYRYD I VDTGIEPRLAVDVSSLPMHAGNATYDTGSSFV 581 

Query: 547 IYPHGDHHHADP 558 

I PH DH H P 
Sbjct: 582 I-PHIDHIHWP 592 

Score =50.8 bits (120), Expect = le-04 

Identities = 54/313 (17%), Positives = 108/313 (34%), Gaps = 35/313 (11%) 

Query: 67 AEQIVIKITDQGYVTSHGDHYHY- -YNGKVPYD ALFSEELLM 106 

+++ + + G HGDH+HY + Y+ A +E 

Sbjct: 428 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 487 
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Query: 107 KDPNYQLKDAD- -IWEVKGGYIIKVDGKYYVYLKDAAHADWRTKDEINRQKQEHVKDN 164 

+ p + k + + K GYI+ DGK Y Y +D + ++ + 
Sbjct: 488 EKPLFDTKKVSRKVTKDGKVGYIMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 542 

Query: 165 EKVNSNVAVARSQGRYTTN-DGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXX 223 

+ + + R + + + DTG4- + + + + PH H H +P 

Sbjct: 543 NHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 602 

Query: 2 24 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 2 83 

++ p s 4- + + + +E + 4- + R 

Sbjct: 603 T I KYVMQHPE WPDVWS KPGHEESGS VI PNVTPLDKRAGMPNWQ I IHS AEEVQKALAEGR 662 

Query: 284 YSESDGLVFDPAKI I SR TPNGVAI PH - - GDHYHF I PYSKLS ALEEKI ARRVP I SG 336 

+ 4- DG + FDP + + + + +IP G I S LS E + A+ + 

Sbjct: 663 FATPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 722 

Query: 337 TG S TVS TNAKPNE 34 9 

+ KP E 
Sbjct: 723 NAGDATDTDKPKE 735 

Score =38.1 bits (87), Expect =0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G I + +V+ T Y+ HGDH+H4- + + P + 

Sbjct: 62 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED -GYGFDANRI IAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 4 88 

IS E D Y F 4- +1 E G+ + + + 4- + K + 1+ Q+- E+V 

Sbjct: 102 I S EELLMTDPNYHFKQSDVINE I LDGYVI KVNGNYYVYI KPGS KRKNIRTKQQI AEQVAK 161 

Query: 489 - - -KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 545 

+ GL ++ ++ E K + 4- + + + + + +A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 214 

Query: 54 6 I I YPHGDHHHADPIDEHKP 564 

+ PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHY I PKKDLS P 233 



tr Q 93GT5 Histidine triad protein of group A streptococci 825 AA 

[htpA] alASIi 
[Streptococcus pyogenes] 



Score = 302 bits (774) , Expect = 2e-80 

Identities = 195/614 (31%), Positives = 297/614 (47%), Gaps = 76/614 (12%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE- -NLTPDQ 58 

MK 4- YI + +A4- + + + + 4-Y L +H +N+++Y+D S+ K+ N T DQ 

Sbjct: 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGSATKDNQIAYIDDSKGKAKAPKTNKTMDQ 6 0 

Query: 59 VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 118 

+ S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY+ K +D4- 
Sbjct: 61 ISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYRFKQSDV 120 
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Query: 119 VNEVKGGY 1 1 KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVA 172 

GY+IKV+G YYVYLK + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 INEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKE 180 

Query: 173 VARS QGR YTTNDGY VFNP AD I I EDTGNAY I VPHGGHYH Y IP XXXXXXXXXX 223 

A+ QGRYTT+DGY+F+P DII+D G+AY4-VPHG HYHYIP 
Sbjct: 181 EVAAVNEAKRQGRYTTDDGYIFSPTDIIDDLGDAYLVPHGNHYHYIPKKDLSPSELAAAQ 240 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPAN 263 

+ PS + + TP 

Sbjct: 241 AYWSQKQGRGARPSDYRPTPAPAPGRRKAP I PDVTPNPGQGHQPDNGGYHPAPPRPNDAS 300 

Query: 2 64 KSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYH 314 

K + + LL +L+ R+ E DGL-f F-+-P + + I G +PHGDH+H 

Sbjct: 301 QNKHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLIFEPTQVIKSNAFGYWPHGDHHH 360 

Query: 315 FIPYSKLSALEEKIARRVPISGT- -GSTVSTNAKPN- -EWXXXXXXXXXXXXXXXXKEL 370 

IP S+LS LE ++A R T + S ++KP+ EV + 

Sbjct: 361 UPRSQLSPLEMELADRYLAGQTEDDDSGSDHSKPSDKEVTHTFLGHRIKAYGKGLDGKP 420 

Query: 371 SSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN- -NSLATPSPSLPI 428 

SD Y+F+ +1 + +HGDHFHYI ++ Q L N + + + 

Sbjct: 421 YDTSDAYVFSKES IHSVDKSGVTAKHGDHFHYI - GFGELEQYELDEVANWVKAKGQADEL 479 

Query: 42 9 NPGI SHEKHEEDGYGFD ANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKH 4 84 

+ E+ +E FD + ++ + + G++M ++++ + L QI A++ 

Sbjct: 480 AAALDQEQGKEKPL - FDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQI AFAEQE 538 

Query: 4 85 LEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKN 544 

L H D IE ++A + + + + + 

Sbjct: 539 LMLKDKKHYRYD I VDTGI EPRLAVDVS S LPMHAGNAT YDTGS S 581 

Query: 545 AI I YPHGDHHHADP 55 8 

+1 PH DH H P 
Sbjct: 582 FVI-PHIDHIHWP 594 



Score = 49.7 bits (117), Expect = 3e-04 

Identities = 53/313 (16%), Positives = 109/313 (33%), Gaps = 35/313 (11%) 

Query: 67 AEQI VI KITDQGYVTSHGDHYHY - - YNGKVPYD ALFSEELLM 106 

+++ +4- G HGDH+HY + Y+ A +E 

Sbjct: 430 SKESIHS VDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 489 

Query: 107 KDPNYQLKDAD- - IVNEVKGGY I I KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKDN 164 

+ p + K + + K GY++ DGK Y Y +D + ++ + +KD 

Sbjct: 490 EKPLFDTKKVSRKVTKDGKVGYMMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 544 

Query: 165 EKVNSNVAVARSQGRYTTN-DGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXX 223 

+ + + + R + + + DTG+++++PH H H +P 

Sbjct: 545 KHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 604 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 283 

+ + p s + + + + +E+ + + R 

Sbjct: 605 TI KYVMQHPE VRPD I WS KPGHEE SGSVI PNVTPLDKRAGMPNWQ I IHS AEEVQKALAEGR 664 

Query: 284 YSESDGLVFDPAKIISR TPNGVAI PH - - GDHYHF I P YS KLS ALEEKI ARRVP I SG 336 

+ + DG +FDP ++++ +IP G I S LS E + A+ + 
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Sbjct: 665 FATPDGYIFDPRDVLAKETFVWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 724 

Query: 3 37 TGS T VS TNAKPNE 34 9 

4- KP E 
Sbjct: 725 NAGDATDTDKPKE 737 

Score =38.1 bits (87), Expect =0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 4 31 

SA +G I + +V+ T Y+ HGDH+H+ + +P + 

Sbjct: 62 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+++ + + + K + 1+ Q+ E+V 

Sbjct: 102 ISEELLMTDPNYRFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAK 161 

Query: 489 - - -KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIVVNKEKNA 545 

+ GL ++ ++ E K + + + + + + + + A 

Sbjct: 162 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 214 

Query: 546 IIYPHGDHHHADPIDEHKP 564 

+ PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHY I PKKDLS P 233 



tr Q8E338 Hypothetical protein gbsl925 [gbsl925] [Streptococcus 481 
agalactiae AA i 
(serotype III)] align 



Score = 301 bits (771) , Expect = 4e-80 

Identities = 163/380 (42%), Positives = 221/380 (57%), Gaps = 57/380 (15%) 

Query: 8 IAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE NLTPDQVSQKE 63 

I+A S + + AY L + +Q K +N VSYV +S K++ N TPDQ+S++E 

Sbjct: 9 ISALSVALFGTGVGAYQLGSYNAQ- -KSDNSVSYVKTDKSDSKAQATAVNKTPDQISKEE 66 

Query: 64 GIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVK 123 

GI AEQIV+KITD GYVTSHGDHYHYYNGKVPYDA+ SEEL+MKDP+Y AD++NEVK 
Sbjct : 67 GISAEQIWKITDDGYVTSHGDHYHYYNGKVPYDAIISEELIMKDPSYVFNKADVINEVK 126 

Query: 124 GGYIIKVX)GKYYVYLKDAAHADNVRTKDEINRQKQEHVKDNEK VNSN 170 

GYIIKV+GKYY+YLK+ + NVRTK++I +Q++E K K ++++ 
Sbjct: 127 DG YI I KVNGKYYL YLKEGS KRTNVRTKEQI QKQREEWS KGGSKGE SGKHS S AKTQALS AS 186 

Query: 171 VAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXX 225 

V A++ GRYTT+DGYVF+P D+I+D G+A++VPHG H+HYIP 
Sbjct: 187 VREAKASGRYTTDDGYVFSPTDVIDDMGDAFLVPHGDHFHYIPKADLSPSELSAAQAYWN 246 

Query: 226 -XXXXXXXNMQPSQLS YSSTASDNNTQS 252 

+ +PS S +S A ++ T 

Sbjct: 247 RKTGRS GNS S KP SNS S S Y I HAS APS GNVS TGRHANAP I S I PRVTHANHW S KP AGNHATAP 306 

Query: 253 VAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDH 312 

T+KP NK L +LK LY P R+ ESDGLV+DPA+ 4- + T GV+IPHG+H 
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Sbjct: 307 KHHAPTTKP INKD S ALD KMLKRL YAQPL YARHVE S DGL VYD PAQVNAFTAI GVS I PHGNH 366 

Query: 313 YHFIPYSKLSALEEKIARRV 3 32 

+HFI Y +S LE + R V 
Sbjct: 3 67 FHFIHYKDMSPLELEATRMV 3 86 



Score = 74.7 bits (182), Expect = 9e-12 

Identities = 74/304 (24%), Positives = 124/304 (40%), Gaps = 40/304 (13%) 

Query: 18 6 YVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTA 24 5 

YVFN AD+I + + YI+ G Y+ + +s 

Sbjct: 114 YVFNKADVINE VKDGY I I KVNGKYYL Y LKEGSKRTNVRTKEQIQKQREEWSKGG 167 

Query: 24 6 SDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGV 305 

S KG + K + S Q+L + + + ++ RY-f DG VF P +1 + 

Sbjct: 168 S KGE S GK - - HS S AKTQAL S AS VRE AKAS GR YTTDDGYVF S PTD VI DDMGD AF 217 

Query: 3 06 AIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXX 3 65 

+PHGDH+H+IP + LS E A+ TG + + +KP+ 
Sbjct: 218 LVPHGDHFHYIPKADLSPSELSAAQAYWNRKTGRS-GNSSKPSN- SSSYIH 266 

Query: 366 XXKELS S ASDGYI FN - PKD I VEET - ATAYI VRHGDHF HYIPKSNQIGQPTLPNNSL 419 

+ SG NPI T A + G+H H+ P + I + 
Sbjct: 267 ASAP S GNVS TGRHANAP I S I PRVTHANHWS KPAGNHATAPKHHAPTTKP INK DSAL 322 

Query: 420 ATPSPSLPINPGISHEKH-EEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQI 478 

L P + +H E DG +D ++ A G + HGNH H+ KD++ ++ 
Sbjct: 323 DKMLKRLYAQP- - L YARHVE S DGL VYD PAQVNAFTAI GVS I PHGNHFHFIHYKDMSPLEL 380 

Query: 479 KAAQ 482 

+A + 

Sbjct: 3 81 EATR 3 84 
Score =57.8 bits (138), Expect = le-06 

Identities = 59/293 (20%), Positives = 112/293 (38%), Gaps = 83/293 (28%) 

Query: 24 0 SYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIIS 2 99 

SY++ SDN+ V K++ S + + + + S+ +G+ + + + 

Sbjct: 28 SYNAQKSDNSVSYV KTDKSDS KAQATAVNKTPDQ I S KEEGI S AEQ I - WK 76 

Query: 300 RTPNGVAI PHGDHYHF IPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXX 355 

T +G HGDHYH+ +PY + + 
Sbjct: 77 I TDDGYVTSHGDHYHYYNGKVP YDAI I S 104 

Query: 356 XXXXXXXXXXXXKELS S ASDG YI FNPKD I VEETATAY I VR - HGDHFHY I PKSN 407 

+EL Y+FN D+4- E YJ + 4- +G + + Y+ + + 

Sbjct: 105 EEL IMKDPS YVFNKADVINE VKDGY 1 1 KVNGKYYLYLKEGSKRTNVRT 152 

Query: 408 --QIGQP TLPNNSLATPSPSLP INPGI SHEKHE - EDGYGFDANRI I AE 452 

QI + + + + S T + S + + ++ +DGY F +1 + 

Sbjct: 153 KEQIQKQREEWSKGGSKGESGKHSSAKTQALSASVREAKASGRYTTDDGYVFSPTDVIDD 212 

Query: 4 53 DESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDY 505 

F++ HG+H HY K DL+ + + AAQ + KT +G S S+ Y 
Sbjct: 213 MGDAFLVPHGDHFHYIPKADLSPSELSAAQAYWNR-KTGRSGNSSKPSNSSSY 264 
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Score = 41.6 bits (96), Expect = 0.080 

Identities = 54/216 (25%), Positives = 84/216 (38%), Gaps = 39/216 (18%) 

Query: 3 84 IVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEED-GY 442 

+V+ T Y+ HGDH+HY + +P + IS E +D Y 

Sbjct: 74 WKI TDDGYVTSHGDHYH YY NGKVPYDAIISEELIMKDPSY 114 

Query: 443 GFDANRI IAEDESGF IMSHGNHNHYFFKKD LTEEQI KAAQKHLEEVKTSHNGL 495 

F+ +1 E + G + I + N +Y + R+ T+EQI QK EE S G 
Sbjct: 115 VFNKADVINEVKDGYIIK-VNGKYYLYLKEGSKRTNVRTKEQI QKQREE- -WSKGG- 167 

Query: 496 DSLSSHEQDYPGNAKEMKDLDKKIEEKIAG- - IMKQYGVKRESI VWKEKNAIIYPHGDH 553 

S E +AK E K +G V + V++ +A + PHGDH 

Sbjct: 168 SKGESGKHSSAKTQALSASVREAKASGRYTTDDGYVFSPTDVIDDMGDAFLVPHGDH 224 

Query: 554 HHADPIDEHKPVGIGHSHSNYELFKPEEGVAKKEGN 589 

HP + P + + + + G + KM 

Sbjct: 225 FHYIPKADLSPSELSAAQAYWNRKTGRSGNSSKPSN 260 



tr Q877Y2 Hypothetical protein SPsl722 [SPS1722] [Streptococcus pyogenes 794 AA 
(serotype M3) ] 

align 

Score = 291 bits (746) , Expect = 3e-77 

Identities = 187/577 (32%), Positives = 277/577 (47%), Gaps = 74/577 (12%) 

Query: 3 6 NNRVSYVDGSQSSQKSE - -NLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGK 93 

4-N+ ++Y+D S+ K+ NT DQ+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGK 
Sbjct: 7 DNQIAYIDDSKGKAKAPKTNKTMDQISAEEGISAEQIWKITDQGYVTSHGDHYHFYNGK 66 

Query: 94 VP YD AL F S EELLMKD PN YQLKDAD I VNE VKGG Y 1 1 KVDGKYYVYLKD AAHADNVRTKDE I 153 

VPYDA+ SEELLM DPNY K +D++NE+ GY+IKV+G YYVY+K + N+RTK +1 
Sbjct: 67 VPYDAIISEELLMTDPNYHFKQSDVINEILDGYVIKVNGNYYVYIKPGSKRKNIRTKQQI 126 

Query: 154 NRQKQEHVKD -NEKVNSNVA VARSQGRYTTNDGYVFNPADI IEDTGNAY 201 

Q + K+ EK + VA A+ QGRYTT+DGY+F+P DII+D G+AY 

Sbjct: 127 AEQVAKGTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTDDGYIFSPTDIIDDLGDAY 186 

Query: 2 02 IVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLS-YSSTASDNNTQSVAKGSTSK 2 60 

+VPHG HYHYIP +++SYT+ ++ T 

Sbjct: 187 LVPHGNHYHYIPKKDLSPSELAAAQAYWSQKQGRGARPSDYRPTPAPGRRKAPIPDVTPN 246 

Query: 261 PAN KSENLQSLLKELYDSPSAQRYSESDGLV 2 91 

p K + + LL +L+ R+ E DGL+ 

Sbjct: 247 PGQGHQPDNGGYHPAPPRPNDASQNKHQRDEFKGKTFKELLDQLHRLDLKYRHVEEDGLI 306 

Query: 2 92 FDPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVPISGTGSTVS TNAKP 347 

F4 .p + + I G +PHGDHYH IP S+LS LE ++A R T S + + 

Sbjct: 3 07 FEPTQVIKSNAFGYWPHGDHYHI IPRSQLSPLEMELADRYLAGQTEDNDSGSDHSKSSD 3 66 

Query: 34 8 NEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSN 407 

EV 4- SD Y+F+ + 1 + +HGDHFHYI 

Sbjct: 367 KEVTHTFLGHRIKAYGKGLDGKPYDTSDAYVFSKESIHSVDKSGVTAKHGDHFHYI -GFG 425 
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Query: 408 QIGQPTLPN- -NSLATPSPSLPINPGISHEKHEEDGYGFD ANRIIAEDESGFIMSH 461 

+ + Q L N 4- 4-4- + E + +E FD + 4-4- + + G+IM 

Sbjct: 426 ELEQYELDEVANWVKAKGQADELAAALDQEQGKEKPL - FDTKKVSRKVTKDGKVGYIMPK 484 

Query: 462 GNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEE 521 

++ + + + L QI A++ L +H D +D IE 

Sbjct: 485 DGKDYFYARDQLDLTQIAFAEQELMLKDKNHYRYDI VDTGIEP 527 

Query: 522 KIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADP 558 

++A + + + + 4- +1 PH DH H P 
Sbjct: 528 RLAVD VS S L PMHAGNAT YDTGS S F V I - PH IDH I H WP 563 



Score =50.8 bits (120), Expect = le-04 

Identities = 54/313 (17%), Positives = 108/313 (34%), Gaps = 35/313 (11%) 

Query: 67 AEQIVIKITDQGYVTSHGDHYHY- -YNGKVPYD ALFSEELLM 106 

4-4-4- + + G HGDH+HY + Y+ A 4-E 

Sbjct: 399 SKESIHSVDKSGVTAKHGDHFHYIGFGELEQYELDEVANWVKAKGQADELAAALDQEQGK 458 

Query: 107 KDPNYQLKDAD- -IWEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVKDN 164 

+ P + K + + K GYI+ DGK Y Y 4-D + 4-4- + 4-KD 

Sbjct: 459 EKPLFDTKKVSRKVTKDGKVGYIMPKDGKDYFYARDQLDLTQIAFAEQ ELMLKDK 513 

Query: 165 EKVNSNVAVARSQGRYTTN-DGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXX 223 

+ + + R + + + DTG+ + 4- ++PH H H +P 

Sbjct: 514 NHYRYDIVDTGIEPRLAVDVSSLPMHAGNATYDTGSSFVIPHIDHIHWPYSWLTRDQIA 573 

Query: 224 XXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQR 283 

4- + p S + + + 4- 4-E + 4-4- R 

Sbjct: 574 T I KYVMQHPE VRPDVWS KP GHEES GS VI PNVTPLDKRAGMPNWQ I IHS AEEVQKALAEGR 633 

Query: 2 84 YSESDGLVFDPAKI ISR TPNGVAI PH - - GDHYHF I PYS KLS ALEEKI ARRVP I SG 33 6 

+ 4- DG 4-FDP + + + 4- +IP G I S LS E 4- A+ 4- 

Sbjct: 634 FATPDGYIFDPRDVLAKETFWKDGSFSIPRADGSSLRTINKSDLSQAEWQQAQELLAKK 693 

Query: 33 7 TGSTVSTNAKPNE 34 9 

4- KP E 
Sbjct: 694 NAGDATDTDKPKE 706 



Score = 38.1 bits (87), Expect = 0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 3 72 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA 4-G I + 4-V+ T Y+ HGDH+H+ 4- +P + 

Sbjct: 33 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 72 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+4-+ 4- + 4- K + 1+ Q+ E+V 

Sbjct: 73 ISEELLMTDPNYHFKQSDVINEILDGYVIKVNGNYYVYIKPGSKRKNIRTKQQIAEQVAK 132 

Query: 489 - - -KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 545 

4- GL ++ ++ E K + + 4-4- 4-4-4- +A 

Sbjct: 133 GTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 185 

Query: 546 IIYPHGDHHHADPIDEHKP 564 

4- PHG+H+H P + P 
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Sbjct: 186 YL VPHGNH YH Y I PKKDL S P 204 



tr Q9AE2 1 Hypothetical protein (Fragment) [Streptococcus agalactiae] 289 AA 

align 



Score = 241 bits (614), Expect = 7e-62 

Identities = 117/227 (51%), Positives = 159/227 (69%), Gaps = 14/227 (6%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSE- -NLTPDQ 58 

MK + YI + +A4- +++ + +Y L +H +N+ ++Y+D S+ K+ NT DQ 

Sbjct : 1 MKKTYGYIGSVAAILLATHIGSYQLGKHHMGLATKDNQIAYIDDSKGKVKAPKTNKTMDQ 60 

Query: 59 VSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADI 118 

+S +EGI AEQIV+KITDQGYVTSHGDHYH+YNGKVPYDA+ SEELLM DPNY+ K +D+ 
Sbjct : 61 ISAEEGISAEQI WKITDQGYVTSHGDHYHFYNGKVPYDAIISEELLMTDPNYRFKQSDV 120 

Query: 119 VNEVKGGY 1 1 KVDGKYYVYLKDAAHADNVRTKDE INRQKQEHVKD - NEKVNSNVA 172 

+NE + GY+IKV+G YYVYLK + N+RTK +1 Q + K+ EK + VA 
Sbjct: 121 INEILDGYVIKVNGNYYVYLKPGSKRKNIRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKE 180 

Query: 173 VARS QGRYTTNDG YVFNPAD 1 1 EDTGNAY I VPHGGHYH Y I P 213 

A+ QGRYTT+DGY+F+P DII+D G+AY+VPHG HYHYIP 
Sbjct: 181 E VAAVNE AKRQGRYTTDDGYI FS PTD I IDDLGDAYLVPHGNHYH YI P 227 



Score = 56.2 bits (134), Expect = 3e-06 

Identities = 33/94 (35%), Positives = 48/94 (50%), Gaps = 24/94 (25%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPK SNQIGQPTLPN 416 

+ DGYIF+P DI + + + AY4-V HG+H+HYIPK S + G+ P+ 

Sbjct: 195 TTDDGYIFSPTDIIDDLGDAYLVPHGNHYHYIPKKDLSPSELAAAQAYWSQKQGRGARPS 254 

Query: 417 NSLATPSP SLPI NPGISHEKHEEDGY 442 

+ TP + P PI NPG H+ + GY 

Sbjct: 255 DYRPTPAPGRRKAP I PDVTPNPGQGHQP - DNGGY 287 



Score = 49.7 bits (117), Expect = 3e-04 

Identities = 45/214 (21%), Positives = 74/214 (34%), Gaps = 70/214 (32%) 

Query: 2 97 IISRTPNGVAIPHGDHYHF 1 P YS KLS ALEEKI ARRVP I SGTGSTVSTNAKPNE WX 3 52 

+ + T G HGDHYHF +PY + + 
Sbjct: 73 WKITDQGYVTSHGDHYHFYNGKVPYDAI IS 103 

Query: 353 XXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVR-HGDHFHYIP KS 406 

+EL Y F D + + E Y + + + Y+ K+ 

Sbjct: 104 EELLMTDPNYRFKQSDVINEILDGYVIKVNGNYYVYLKPGSKRKN 148 

Query: 407 NQIGQPTLPNNSLATPSPSLPINPGISHEKHEE- DGYGFDANRI I 450 

+ Q + T ++H EE DGY F II 

Sbjct: 149 IRTKQQIAEQVAKGTKEAKEKGLAQVAHLSKEEVAAVNEAKRQGRYTTDDGYIFSPTDII 208 

Query: 451 AEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKH 4 84 

+ +++ HGNH HY KKDL+ ++ AAQ + 
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Sbjct: 2 09 DDLGD A YL VPHGNH YH Y I P KKD L S P S E L AAAQ AY 242 



Score = 38.1 bits (87), Expect = 0.88 

Identities = 39/199 (19%), Positives = 78/199 (38%), Gaps = 33/199 (16%) 

Query: 372 SASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPG 431 

SA +G I + +V+ T Y+ HGDH+H+ 4- +P + 

Sbjct: 62 SAEEG- ISAEQIWKITDQGYVTSHGDHYHFY NGKVPYDAI 101 

Query: 432 ISHEKHEED-GYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV- - 488 

IS E D Y F + +1 E G+ + + + + + K + 1+ Q+ E+V 

Sbjct: 102 I S EELLMTDPNYRFKQSDVINE I LDGYVI KVNGNYYVYLKPGS KRKNIRTKQQ I AEQVAK 161 

Query: 489 KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNA 54 5 

+ GL ++ ++ E K + + + + + + + +A 

Sbjct: 162 GTKEAKEKGLAQVAHLS KEE VAAVNEAKRQGRYTTD DGYIFSPTDI IDDLGDA 214 

Query: 546 IIYPHGDHHHADPIDEHKP 564 

+ PHG+H+H P + P 
Sbjct: 215 YLVPHGNHYHYIPKKDLSP 233 



t r Q8DQ06 Pneumococcal histidine triad protein E, truncation 182 AA 

[phtE- truncation] [Streptococcus pneumoniae (strain align 
ATCC 

BAA-255 / R6) ] 



Score = 121 bits (304) , Expect = 6e-26 

Identities = 67/156 (42%), Positives = 91/156 (57%), Gaps = 16/156 (10%) 

Query: 370 LSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPN-NSLATPSPSLPI 428 

+S+ DGY+FNP DIV +T AYIVRHGDH+HYIPKS+ P+ N + + S S+ 
Sbjct : 9 VS AVX)DGYVFNPND I VRDTGDAYI VRHGDH YHYI PKS SLRNPPSHSNTEE VGS S S S S VLS 68 

Query: 42 9 NPGISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEV 4 88 

NP + EEDG+GFDANRI I +ED GF + + HG+HNHY IK K E 

Sbjct: 69 NP S LHVHHE E EDGHGFDANR IIS ED S EGF V I PHGDHNHY I KVQTKG YE - - 116 

Query: 4 89 KTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIA 524 

N + SL S +YP + K + +A 
Sbjct: 117 AALKNKIPSLQS NYPPGTFDEKAVLAKVDQLLA 149 



Score = 87.4 bits (215), Expect = le-15 

Identities = 54/161 (33%), Positives = 74/161 (45%), Gaps = 39/161 (24%) 

Query: 173 VARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXX 232 

V + G +DGYVFNP DI+ DTG+AYIV HG HYHYIP 
Sbjct: 3 VTPNNGVSAVDDGYVFNPNDIVRDTGDAYIVRHGDHYHYIP 4 3 

Query: 2 33 NMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVF 2 92 

+ S ++ S +NT+ V S + S +N S +4- + E DG F 

Sbjct: 44 KSSLNNPPSHSNTEEVGSSSSSVLSNPSLHV HHEEEDGHGF 84 

Query: 293 DPAKIISRTPNGVAIPHGDHYHFIPYSKLSALEEKIARRVP 333 
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D +IIS G IPHGDH H+I + E + ++P 

Sbjct: 85 DANR I I SEDSEGFVI PHGDHNHYI KV - QTKGYEAALKNKI P 124 



Score =46.6 bits (109), Expect = 0.002 

Identities = 32/135 (23%), Positives = 58/135 (42%), Gaps = 34/135 (25%) 

Query: 426 LPINP- -GISHEKHEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQK 483 

+P+ P G+S +DGY F+ N 1+ + +1+ HG+H HY K L + 
Sbjct : 1 MPVTPNNGVSAV DDGYVFNPND I VRDTGDAYI VRHGDHYHYI PKSSLNNPP SHS 54 

Query: 484 HLEEVKTSHNGL DSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWN 540 

+ EEV +S + + SL H ++ G+ + + + + 

Sbjct: 55 NTEE VGS S S S S VLSNPS LHVHHEEEDGHGFDANR IIS 91 

Query: 541 KEKNAI I YPHGDHHH 555 

++ + PHGDH+H 

Sbjct: 92 EDS EGFVI PHGDHNH 10 6 



tr Q8E029 Hypothetical protein SAG0907 [SAG0907] [Streptococcus 877 
agalactiae AA 
(serotype V)] al ign 



Score = 111 bits (277) , Expect = 8e-23 

Identities = 109/406 (26%), Positives = 170/406 (41%), Gaps = 59/406 (14%) 

Query: 250 TQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDG-LVFDPAKIISRTPNGVAIP 308 

T+ VAK K ANK + + + + DG L+ +KI+S T +G+ + 

Sbjct: 36 TKQVAK KKANKKKVSVKESHKKRKGVAGVDFPTDDGFLLTKDSKILSHTDSGIWA 91 

Query: 309 HGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXK 368 

HG+H HFI YS L K + +P +T N+ V 
Sbjct: 92 HGNHSHFIFYSDLKG- -SKFSYLIP NANT KTNKNQ AVRNF KAGAVA 135 

Query: 369 ELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPI 428 

+++ +DGY+FNP DIV E A Y+VRHGDHFHYIPK++ Q + A+ + S 

Sbjct: 136 - VNTLNDG YVFNP AD I VS ED ANG Y WRHGDHFH Y I PKAS L S QQKQVQ ASRAVSRLG 190 

Query: 429 NPGISHEK HEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQ 477 

N SH + +G+ F+ I +G H NH H+ DL + 

Sbjct: 191 NQSNSHYRVNS S KI AGLHYPTSNGFLFNGRG I KGTTPTGI LVEHHNHLHF I S FADLRKGG 250 

Query: 478 IKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI 537 

+ + K + + S SS + + LK I++K+A + + I 

Sbjct: 251 WGS IADRYQPQKKADSKKQSPSSKK PRTENTLPKD I KDKLAYLARELHLD I SRI 304 

Query: 538 WN KEKNAI I YPHGDHHH AD P I DEHKP VG I GH SHSNYELFKPEEGV 583 

V K N I YPH DH H A ID KP+ H H + E E 

Sbjct: 305 RVLKTLNGEIGFEYPHDDHTHVIMAKDIDLSKPIPNPHHDDEDHHKGHHHDESDHKHEEH 364 

Query: 584 AKKEGNKVYTGEELTNVVNLLKNSTFN-NQNFTLANGQKRVSFSFP 62 8 

+ NK+ + E+ ++ L + N NQ L + + F +P 
Sbjct: 365 EHTKSNKL - SDEDQKKLI YLAEKLGLNPNQIEVLTSEDGS I IFKYP 409 
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Score = 100 bits (250) , Expect = le-19 

Identities = 107/468 (22%) , Positives = 192/468 (40%) , Gaps = 92/468 (19%) 

Query: 13 9 KDAAHADI^RTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNP-ADIIEDT 197 

K+ + H + + +K+ VK++ K VA + T+DG++ + 1+ T 

Sbjct: 29 KNSHHIKTKQVAKKKANKKKVSVKESHKKRKGVAGVD FPTDDGFLLTKDSKILSHT 84 

Query: 19 8 GNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGS 2 57 

+ +VHGH H+I + + + S + SY + + T 
Sbjct: 85 DSGIWAHGNHSHFI FYSDLKGSKFSYLIPNANTKT 120 

Query: 258 TSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIP 317 

NK+ + ++ + + + + + +DG VF+PA I+S NG + HGDH+H+IP 

Sbjct: 121 NKNQAVRN FKAGAVAVNTLNDG YVFNPAD I VS EDANG Y WRHGDHFH Y I P 170 

Query: 318 YSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSASDGY 377 

+ LS + + A R +S G+ ++ + + N S+G + 

Sbjct: 171 KASLSQQKQVQASRA-VSRLGNQSNSHYRVNSSKIAGLHY PTSNGF 215 

Query: 378 I FNP KD I VEE TATAY I VRHGDH FH Y I PKSNQIGQPTLPNNSL 419 

+ FN +1 T T +V H +H H+I P+ + P + + 

Sbjct: 216 LFNGRGIKGTTPTGILVEHHNHLHFISFADLRKGGWGSIADRYQPQKKADSKKQSPSSKK 275 

Query: 420 ATPS PSLP - - INPGI SHEKHEEDGYGFDANRI IAEDESGFIMSHGNHNHYFFKKDL 473 

+LP I + + + E D +RI E GF H +H H KD+ 

Sbjct: 276 PRTENTLPKDIKDKLAYLARE LHLD I SRI RVLKTLNGE IGFE YPHDDHTHVIMAKD I 332 

Query: 474 T-EEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGV 532 

+ 1 E+ H+ +S HE + + ++ D D+K K+ + ++ G+ 
Sbjct: 333 DLSKPIPNPHHDDEDHHKGHHHDESDHKHEEHEHTKSNKLSDEDQK KLIYLAEKLGL 389 

Query: 533 KRESI - WNKEKNAII - -YPHGDHHH ADPIDEHKPVGIG-HSHSN 573 

I V+ E +11 YPH DH H +1+ KP+ G H HS + 
Sbjct: 390 NPNQIEVLTSEDGSIIFKYPHDDHSHTIASKDIEIGKPIPDGHHDHSH 437 



Score = 52.8 bits (125), Expect = 3e-05 

Identities = 70/346 (20%), Positives = 120/346 (34%), Gaps = 84/346 (24%) 

Query: 21 C AYALNQHRS QENKDNNR VS Y VDGSQSSQKSENLTPDQVSQKEG- - IQ 66 

C +AL +QE+K+++ + V +S +K + + +G + 

Sbjct : 16 CHFALTACHTQEHKNSHHIKTKQVAKKKANKKKVSVKESHKKRKGVAGVDFPTDDGFLLT 75 

Query: 67 AEQ I VI KI TDQG YVTSHGDHYHYYNGKVP YDALFSEELLMKDPN 110 

+ + + TD G V +HG+H H+ + Y L + PN 

Sbjct: 76 KDSKILSHTDSGIVVAHGNHSHF IFYSDLKGSKFSYLIPNANTKTNKNQAVRNFKA 131 

Query: 111 YQLKDADIVNEVKGGYI IKVDGKYYVYLKDAAHADNV RTKDEIN 154 

Y ADIV+E GY+ + + G + + Y+ A+ + R + 

Sbjct: 132 GAVAVNTLNDG YVFNPAD I VS EDANG YWR - HGDHFHYI PKAS LSQQKQVQASRAVSRLG 190 

Query: 155 RQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPX 214 

Q H + N + + Y T++G++FN I T +V H H H+I 

Sbjct: 191 NQSNSHYRVNSSKIAGL HYPTSNGFLFNGRGI KGTTPTGI LVEHHNHLHF I S F 243 

Query: 215 XXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKE 274 

+ S A Q A P++K ++ L + 

Sbjct: 244 ADLRKG GWGSIADRYQPQKKADSKKQSPSSKKPRTENTLPK 284 



http://us.expasy.org/cgi-bin/blast.pl 



9/27/04 



ExPASy BLAST2 Interface Page 82 of 1 1 6 



Query: 275 LYDSPSAQRYSESDGLVFDPAKI - ISRTPN GVAIPHGDHYHFI 316 

A ELD + 4-1 + 4-T N G PH DH H I 
Sbjct: 285 D I KDKLAYLARE LHLDISRIRVLKTLNGEIGFEYPHDDHTHVI 327 



tr Q8E5R2 Hypothetical protein gbs0918 [gbs0918] [Streptococcus 877 
agalactiae AA 
(serotype III)] align 

Score = 109 bits (273) , Expect = 2e-22 

Identities = 110/406 (27%), Positives = 169/406 (41%), Gaps = 59/406 (14%) 

Query: 250 TQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDG-LVFDPAKIISRTPNGVAIP 308 

T+ VAK K ANK + + + +" DG L+ +KI + S +G+ + 

Sbjct: 36 TKQVAK KKANKKKVSVKESHKKRKGVAGVDFPTDDGFLLTKDSKILSHPDSGI WA 91 

Query: 309 HGDHYHFIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXK 368 

HG+H HFI YS L K 4- 4-P + NAK N+ 
Sbjct: 92 HGNHSHFIFYSDLKG- -SKFSYLIP NANAKTNKKQAVRNFKAGAVA 135 

Query: 369 ELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPI 42 8 

4-4-4- 4-DGY+FNP DIV E A Y+VRHGDHFHYI PK+ + Q + A+ + S 

Sbjct: 136 - VNTLNDGYVFNPADI VSEDANGYWRHGDHFHYI PKASLSQQKQVQ ASRAVSRLG 190 

Query: 429 NPGISHEK -HEEDGYGFDANRIIAEDESGFIMSHGNHNHYFFKKDLTEEQ 477 

N SH + DG+ F + I +G ++ H NH H+ DL + 

Sbjct: 191 NQNNSHYRVNSSKIAGLHHPTSDGFLFNGQGIKGTTPTGILVEHHNHLHFISFADLRKGG 250 

Query: 478 IKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQYGVKRESI 537 

+ + K + S SS + + LK I+4-K+A + +4- + I 

Sbjct: 251 WGS IADRYQPQKKVDSKKQSPSSKK PRTENTLPKDIKDKLAYLARELHLDISRI 304 

Query: 538 WNKEKNAII YPHGDHHH ADPIDEHKPVGIGH SHSNYELFKPEEGV 583 

V K N 1 YPH DH H A ID KP+ H H 4- E E 

Sbjct: 305 RVLKTLNGEIGFEYPHDDHTHVIMAKDIDLSKPIPNPHHDDEDHHKGHHHDESDHKHEEH 364 

Query: 5 84 AKKEGNKVYTGEELTNVWLLKNSTFN-NQNFTLANGQKRVSFSFP 62 8 

+ NK+ + E+ ++ L + N NQ L + + F +P 

Sbjct: 365 EHTKSNKL - SDEDQKKLIYLAEKLGLNPNQIEVLTSEDGS I IFKYP 409 

Score = 100 bits (248) , Expect = 2e-19 

Identities = 110/471 (23%), Positives = 186/471 (39%), Gaps = 98/471 (20%) 

Query: 139 KDAAHADWRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNP-ADIIEDT 197 

K++ H + + +K+ VK++ K VA 4- T4-DG4-+ + 1 + 

Sbjct: 29 KNSHH I KTKQVAKKKANKKKVS VKESHKKRKGVAGVD FPTDDGFLLTKDSKILSHP 84 

Query: 198 GNAYIVPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSY SSTASDNNTQSVA 254 

+ +V HGH H+I +++ S+ SY ++ A N Q+V 

Sbjct: 85 DSGIWAHGNHSHFI F YS D L KG S KF S YL I PN ANAKTNKKQ AVR 127 

Query: 255 KGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYH 314 

A + N DG VF4-PA I + S NG + HGDH+H 

Sbjct: 12 8 NFKAGAVAVNTLN DGYVFNPADIVSEDANGYWRHGDHFH 167 
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Query: 315 FIPYSKLSALEEKIARRVPISGTGSTVSTNAKPNEWXXXXXXXXXXXXXXXXKELSSAS 374 

+ IP + LS ++ A R +S G+ +++ + N S 
Sbjct: 168 Y I PKAS LSQQKQVQASRA- VSRLGNQNNSHYRVNS S KI AGLHH PTS 212 

Query: 3 75 DGYIFNPKDIVEETATAYIVRHGDHFHYI PKSNQIGQPTLPN 416 

DG++FN + I T T +V H +H H+I P+ + P+ 

Sbjct: 213 DGFLFNGQGIKGTTPTGILVEHHNHLHFISFADLRKGGWGSIADRYQPQKKVDSKKQSPS 272 

Query : 417 NSLATPSPSLP - - INPGISHEKHEEDGYGFDANRI IAEDESGFIMSHGNHNHYFFK 47 0 

+ +LP I +++ E D +RI E GF H +H H 

Sbjct: 273 SKKPRTENTLPKD I KDKLAYLARE LHLD I S R I R VLKTLNGE I GFE YPHDDHTHV I MA 329 

Query: 471 KDLT-EEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKIEEKIAGIMKQ 52 9 

KD+ +1 E+ H+ +S HE + + ++ D D+K K+ + ++ 
Sbjct: 330 KDIDLSKPIPNPHHDDEDHHKGHHHDESDHKHEEHEHTKSNKLSDEDQK KLIYLAEK 386 

Query: 53 0 YGVKRES I - WNKEKNAI I - - YPHGDHHH ADPIDEHKPVGIG-HSHSM 573 

G+ I V+ E +11 YPH DH H +1+ KP+ G H HS + 

Sbjct: 387 LGLNPNQIEVLTSEDGSIIFKYPHDDHSHTIASKDIEIGKPIPDGHHDHSH 437 



Score =48.9 bits (115), Expect = 5e-04 

Identities = 68/345 (19%), Positives = 120/345 (34%), Gaps = 82/345 (23%) 

Query: 21 CAYALNQHRSQENKDNNRVSY VDGSQSSQKSENLTPDQVSQKEG- - IQ 66 

C + AL +QE+K+ + + + V +S +K -f + +G + 

Sbjct : 16 CHFALTACHTQEHKNSHHIKTKQVAKKKANKKKVSVKESHKKRKGVAGVDFPTDDGFLLT 75 

Query: 67 AEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALFSEELLMKDPN 110 

4- +4- D G V +HG+H H+ + Y L + PN 

Sbjct: 76 KDSKILSHPDSGIWAHGNHSHF IFYSDLKGSKFSYLIPNANAKTNKKQAVRNFKA 131 

Query: 111 YQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNV RTKDEIN 154 

Y ADIV+E GY+++ G ++ Y+ A+ 4- R + 

Sbjct: 132 GAVAVNTLNDGYVFNPADIVSEDANGYWR-HGDHFHYIPKASLSQQKQVQASRAVSRLG 190 

Query: 155 RQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPX 214 

Q H + N + + 4- T4-DG+ + FN I T 4-V H H H+I 

Sbjct: 191 NQNNSHYRVNSSKIAGL -HHPTSDGFLFNGQGIKGTTPTGILVEHHNHLHFI - - 241 

Query: 215 XXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKE 2 74 

QP +■ S S ++ + 4- +P+4-LL+E 
Sbjct : 242 S FADLRKGGWGS I ADRYQPQKKVDS KKQS PS SKK - - PRTENTLPKD I KDKLAYLARE 296 

Query: 2 75 LYDSPSAQRYSESDGLVFDPAKIISRTPN GVAI PHGDHYHF I 316 

L+ S R + +T N G PH DH H I 

Sbjct: 297 LHLDISRIR VLKTLNGEIGFEYPHDDHTHVI 327 



tr Q8P 0G5 Putative internalin A [spyM18_1373] [Streptococcus pyogenes 792 AA 
(serotype M18) ] 

align 



Score = 99.4 bits (246), Expect = 3e-19 

Identities = 93/350 (26%), Positives = 143/350 (40%), Gaps = 72/350 (20%) 
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Query: 237 SQLSYSSTASDNN TQSVAKGSTS KPANKSENLQSLLKELYDS PSAQRYSES 287 

SQL+ + S N T+ KG TS KP KS+ + K + + + 

Sbjct : 16 SQLTL I ACQSRGNGTYP I KTKQSRKGMTSNKI KP I KKS KKINKTHKGV AGVDFPTD 71 

Query: 288 DGLVFDP-AKIISRTPNGVAIPHGDHYHFIPYSKL- -SALEEKIARRVPISGTGSTVSTN 344 

DG + +KI+S+T G + + H H HFI Y+ L S E I + + 
Sbjct: 72 DGF I LTKDS KI LS KTDQG I WDHDGHSHF I F YADLKGS PFEYL I PKGASL 121 

Query: 345 AKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIP 404 

AKP + + + y FNP DIV E A Y VRH DHFHYI 

Sbjct: 122 AKP AVAQQAASQGTS KVADPHHHYEFNPAD I VAEDALGYTVRHDDHFHY I L 172 

Query: 405 KSNQIGQPTLPNNSLAT- -PSPSLPIN PGISHEKHEEDGYGFDANRIIAEDES 455 

KS+ GQ +AT P S P+ + PG+ DG+ F+ 1+ ++ 

Sbjct: 173 KSSLSGQTQAQAKQVATRLPQTSSPVSTVTANGIPGLHFP- -TSDGFQFNGQGIVGVTKA 23 0 

Query: 456 GFIMSHGNHNHYFFKKDLTE EQ I KAAQKHLEE VKTSHNGLDS LS SHEQD YPGN 508 

+ + +H H H DL + +Q A+K + +T H 
Sbjct: 231 SILVNHDGHLHPISFADLRQGGWAHVADQYDPAKKAEKPAETHH 274 

Query: 509 AKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI I YPHGDHHH 555 

E+ + + K+ +EK+A + ++ G+ +1 K + YPH DH H 

Sbjct: 275 TPELSEREKEYQEKLAYLAEKLGIDPSTIKRVETQDGKLGLEYPHHDHAH 324 

Score = 51.6 bits (122), Expect = 8e-05 

Identities = 73/360 (20%), Positives = 135/360 (37%), Gaps = 68/360 (18%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCA YALNQHRSQENKDNNRVSYVDGSQSSQKS - 51 

MK K I G + L+L A Y + + + + S+ K+ 

Sbjct : 1 MKTKKVT I LVGLLLS SQLTL I ACQSRGNGTYP I KTKQSRKGMTSNKI KP I KKS KKINKTH 60 

Query: 52 ENLTPDQVSQKEG- - IQAEQIVIKITDQGYVTSHGDHYHY YNGKVPYDAL 99 

+ + +G + + ++ TDQG V H H H4- + +P A 

Sbjct : 61 KGVAGVDFPTDDGFILTKDSKILSKTDQGIWDHDGHSHFIFYADLKGSPFEYLIPKGAS 120 

Query: 100 FSEELLMK DPN- - YQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADN 146 

+ + + + DP-H Y+ ADIV E GY ++ D ++ LK + 

Sbjct: 121 L AKP AVAQQAAS QGTS KVADPHHHYE FNP AD I VAEDALGYTVRHDDHFHY ILKSSLSG-- 178 

Query: 147 VRTKDE INRQKQEHVKDNEKVNSNVAVARS QG RYTTNDGYVFNPADI IEDTGNAYI 202 

+ Q + +S V+ + G + T+DG+ FN 1+ T + + 

Sbjct: 179 QTQAQAKQVATRLPQTSSPVSTVTANGIPGLHFPTSDGFQFNGQGIVGVTKASIL 233 

Query: 203 VPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPA 262 

V H GH H I P++ + A ++T +++ + 
Sbjct: 234 VNHDGHLHPI SFADLRQGGWAHVADQYDPAKKA-EKPAETHHTPELSE REK 283 

Query: 263 NKSENLQSLLKELYDSPSA-QRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKL 321 

ELL ++L PS +R DG 4- G+ PH DH H + S + 

Sbjct: 284 E YQEKLAYLAEKLG IDPST I KRVETQDGKL GLEYPHHDHAHVLMLSDI 331 



tr Q8K714 Putative internalin A [inlA] [Streptococcus pyogenes (serotype 792 
M3)] AA 
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align 



Score = 97.4 bits (241), Expect = le-18 

Identities = 93/350 (26%), Positives = 140/350 (39%), Gaps = 72/350 (20%) 

Query: 2 37 SQLSYSSTASDNN ; TQSVAKGSTS KPANKSENLQSLLKELYDSPSAQRYSES 2 87 

SQL + + S N T+ KG TS KP KS+- K + + + 
Sbj ct : 16 SQLTLIACQSRGNGTYPIKTKQSRKGMTSNKIKPIKKSKKTNKTHKGV- AGVDFPTD 71 

Query: 288 DGLVFDP - AKI ISRTPNGVAIPHGDHYHFIPYSKL - - SALEEKIARRVPISGTGSTVSTN 344 

DG + +KI + S+T G+ 4- H H HFI Y+ L S E I + + 
Sbjct: 72 DGFILTKDSKILSKTDQGIWDHDGHSHFIFYADLKGSPFEYLIPKGASL 121 

Query: 345 AKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIP 404 

AKP +++ Y FNP DIV E A Y VRH DHFHYI 

Sbjct: 122 AKP AVAQQ AAS QGTS KVAD PHHH YE FNPAD I VAED ALG YT VRHDDHFH Y I L 172 

Query: 405 KSNQIGQPTLPNNSLAT- -PSPSLPIN PGISHEKHEEDGYGFDANRIIAEDES 455 

KS+ GQ +AT P S P++ PG+ DG+ F+ 1+ + 

Sbjct: 173 KSSLSGQTQAQAKQVATRLPQTSSPVSTATANGIPGLHFP- -TSDGFQFNGQGIVGVTKD 230 

Query: 456 GFIMSHGNHNHYFFKKDLTE EQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGN 508 

+ + H H H DL + +Q A+K + +T H 
Sbjct: 231 NT LVDHDGHLHP I S FADLRQGGWAHVADQ YD PAKKAEKPAETHH 274 

Query: 509 AKEMKDLDKKIEEKIAGIMKQYGVKRESI WNKEKNAI I YPHGDHHH 555 

E+ + +K+ +EK+A + +4- G+ +1 K + YPH DH H 

Sbjct: 275 TPELSEREKEYQEKLAYLAEKLGIDPSTIKRVETQDGKLGLEYPHHDHAH 324 



Score = 54.7 bits (130), Expect = 9e-06 

Identities = 74/360 (20%), Positives = 137/360 (37%), Gaps = 68/360 (18%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCA YALNQHRSQENKDNNRVSYVDGSQSSQKS - 51 

MK K I G + L+L A Y + +S + + + S+ + K+ 

Sbjct: 1 MKTKKVI ILVGLLLS S QLTL I ACQSRGNGT YP I KTKQSRKGMTSNKI KP I KKSKKTNKTH 60 

Query: 52 ENLTPDQVSQKEG- - IQAEQIVIKITDQGYVTSHGDHYHY YNGKVPYDAL 99 

+ + +G + + ++ TDQG V H H H+ + +P A 

Sbjct: 61 KGVAGVDFPTDDGFILTKDSKILSKTDQGIVVDHDGHSHFIFYADLKGSPFEYLIPKGAS 120 

Query: 10 0 FSEELLMK DPN- -YQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADN 146 

++ + + DP+ Y+ ADIV E GY D ++ - LK + 

Sbjct: 121 LAKPAVAQQAASQGTSKVADPHHHYEFNPADIVAEDALGYTVRHDDHFHYILKSSLSG-- 178 

Query: 147 VRTKDEINRQKQEHVKDNEKVNSNVAVARSQG R YTTNDG YVFNP AD I I EDTGNAY I 202 

+ Q + + + +S V+ A + G + T+DG+ FN 1+ T + + 

Sbjct: 179 QTQAQAKQVATRLPQTSSPVSTATANGIPGLHFPTSDGFQFNGQGIVGVTKDNIL 233 

Query: 203 VPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPA 262 

V H GH H I P++ + A ++T ++ + + 
Sbjct: 234 VDHDGHLHPI S FADLRQGGWAHVADQ YDPAKKA-EKPAETHHTPELSE REK 283 

Query: 263 NKSENLQSLLKEL YDS PS A -QRYSESDGLVFDPAKI ISRTPNGVAIPHGDHYHFIPYSKL 321 

ELL +4-L PS +R DG + G+ PH DH H + S + 

Sbjct: 284 E YQEKLAYLAEKLG IDPS T I KRVETQDGKL GLEYPHHDHAHVLMLSDI 331 
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tr Q 7 9XH7 Putative internalin A [SPs0825] [Streptococcus pyogenes 792 
(serotype AA 
M3)] align 

Score = 97.4 bits (241), Expect = le-18 

Identities = 93/350 (26%), Positives = 140/350 (39%), Gaps = 72/350 (20%) 

Query: 237 SQLSYSSTASDNN TQSVAKGSTS KPANKSENLQSLLKELYDSPSAQRYSES 287 

SQL4- + S N T+ KG TS KP KS+ K + + + 

Sbjct : 16 S QLTL I ACQSRGNGTYP I KTKQSRKGMTSNKI KP I KKSKKTNKTHKGV AGVDFPTD 71 

Query: 288 DGLVFDP-AKIISRTPNGVAIPHGDHYHFIPYSKL- -SALEEKIARRVPISGTGSTVSTN 344 

DG + +KI+S+T G-f + H H HFI Y+ L S E I + 4- 
Sbjct: 72 DGF I LTKDSKI LS KTDQGI WDHDGHSHF I F YADLKGS PFEYL I PKGAS L 121 

Query: 345 AKPNEWXXXXXXXXXXXXXXXXKELSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIP 404 

AKP +++ Y FNP DIV E A Y VRH DHFHYI 

Sbjct: 122 AKP AVAQQAAS QGTS KVAD PHHH YE FNP AD I VAEDALG YT VRHDDHFH Y I L 172 

Query: 405 KSNQIGQPTLPNNSLAT- -PSPSLPIN PGISHEKHEEDGYGFDANRI IAEDES 455 

KS+ GQ +AT P S P++ PG+ DG+ F+ 1+ + 

Sbjct: 173 KSSLSGQTQAQAKQVATRLPQTSSPVSTATANGIPGLHFP- -TSDGFQFNGQGIVGVTKD 230 

Query: 456 GFIMSHGNHNHYFFKKDLTE EQ IKAAQKHLEE VKTSHNGLDS LS SHEQD YPGN 508 

+ 4- H H H DL 4- +Q A+K + +T H 
Sbjct: 231 NI LVDHDGHLHP I S FADLRQGGWAHVADQ YDPAKKAEKPAETHH 274 

Query: 509 AKEMKDLDKKIEEKIAGIMKQYGVKRES I WNKEKNAI I YPHGDHHH 555 

E+ + +K+ +EK+A 4- 4-4- G+ +1 K + YPH DH H 

Sbjct: 275 TPELSEREKEYQEKLAYLAEKLGIDPSTIKRVETQDGKLGLEYPHHDHAH 324 



Score =54.7 bits (130), Expect = 9e-06 

Identities = 74/360 (20%), Positives = 137/360 (37%), Gaps = 68/360 (18%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCA YALNQHRSQENKDNNRVSYVDGSQSSQKS - 51 

MK K I G + L+L A Y + +S + + 4-N++ + S+ + K+ 

Sbjct : 1 MKTKKVIILVGLLLSSQLTLIACQSRGNGTYPIKTKQSRKGMTSNKIKPIKKSKKTNKTH 60 

Query: 52 ENLTPDQVSQKEG- - IQAEQIVIKITDQGYVTSHGDHYHY YNGKVPYDAL 99 

+ + +G + + TDQG V H H H+ 4- 4-P A 

Sbjct : 61 KGVAGVDFPTDDGFILTKDSKILSKTDQGIWDHDGHSHFIFYADLKGSPFEYLIPKGAS 12 0 

Query: 100 FSEELLMK DPN- - YQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADN 146 

+ + + + DP4- Y+ ADIV E GY 4-4- D 4-4- LK + 

Sbjct: 121 L AKP AVAQQAAS QGTS KVAD PHHH YE FNP AD I VAEDALG YT VRHDDHFH Y ILKSSLSG-- 178 

Query: 147 VRTKDE INRQKQEHVKDNEKVNSNVAVARS QG RYTTNDGYVFNPADI IEDTGNAYI 2 02 

+ Q ++ + +S V+ A + G + T+DG4- FN 1+ T + 4- 

Sbjct: 179 QTQAQAKQVATRLPQTSSPVSTATANGIPGLHFPTSDGFQFNGQGIVGVTKDNIL 233 

Query: 2 03 VPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPA 2 62 

V H GH H I P + 4- 4- A +4-T 4-4-+ + 
Sbjct: 234 VDHDGHLHPI SFADLRQGGWAHVADQYDPAKKA-EKPAETHHTPELSE REK 283 

Query: 263 NKS ENLQSLLKEL YDS PS A - QRYS ESDGL VFDPAKI I SRTPNGVAI PHGDHYHFI P YS KL 321 

EL L 4- + L PS +R DG 4- G+ PH DH H + S + 

Sbjct: 284 EYQEKLAYLAEKLGIDPSTIKRVETQDGKL GLEYPHHDHAHVLMLSD I 331 
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tr Q99Z76 Putative internalin A [inlA] [Streptococcus 792 AA 

pyogenes] a lign 

Score = 91.7 bits (226), Expect = 7e-17 

Identities = 89/343 (25%), Positives = 137/343 (38%), Gaps = 58/343 (16%) 

Query: 237 SQLSYSSTASDNN TQSVAKGSTS KPANKSENLQSLLKELYDSPSAQRYSES 287 

SQL+ + S N T+ KG TS KP KS+ K + + + 
Sbjct : 16 SQLTLIACQSRGNGTYPIKTKQSRKGMTSNKIKPIKKSKKTNKTHKGV AGVDFPTD 71 

Query: 288 DGLVFDP-AKIISRTPNGVAIPHGDHYHFIPYSKL- - SALEEKIARRVPISGTGSTVSTN 344 

DG + +KI+S+T G+ + H H HFI Y+ L S E I + + 
Sbjct: 72 DGFILTKDSKILSKTDQGIWDHDGHSHFIFYADLKGSPFEYLIPKGASL 121 

Query: 345 AKPNE WXXXXXXXXXXXXXXXX KE L S S AS DG Y I FNP KD I VE E TAT AY I VRHGDH FH Y I P 404 

AKP + + Y FNP DIV E A Y VRH DHFHYI 

Sbjct: 122 AKP AVAQRAASQGTSKVADPHHHYEFNPADIVAEDALGYTVRHDDHFHYIL 172 

Query: 405 KSNQIGQPTLPNNSLAT- -PSPSLPIN PGISHEKHEEDGYGFDANRIIAEDES 455 

KS+ GQ +AT P S ++ PG+ DG+ F+ 1+ + 

Sbjct: 173 KSSLSGQTQAQAKQVATRLPQTSSLVSTATANGIPGLHFP- -TSDGFQFNGQGIVGVTKD 230 

Query: 456 GFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDL 515 

+ +HHH DL+ +K+ + +H+ E+ + 

Sbjct: 231 S I LVDHDGHLHP I S FADLRQGGWAHVADQ YD PAKKAE KPAE THQ TPELSER 281 

Query: 516 DKKIEEKIAGIMKQYGVKRES I WNKEKNAI I YPHGDHHH 555 

+K+ +EK+A + ++ G+ +1 K + YPH DH H 

Sbjct: 282 EKEYQEKLAYLAEKLGIDPSTIKRVETQDGKLGLEYPHHDHAH 324 



Score =55.1 bits (131), Expect = 7e-06 

Identities = 74/360 (20%), Positives = 137/360 (37%), Gaps = 68/360 (18%) 

Query: 1 MKFSKKYIAAGSAVIVSLSLCA YALNQHRS QENKDNNRVS YVDGS QS S QKS - 51 

MK K I G + L+L A Y + +S + + + S+ + K+ 

Sbjct : 1 MKTKKVIILVGLLLSSQLTLIACQSRGNGTYPIKTKQSRKGMTSNKIKPIKKSKKTNKTH 60 

Query: 52 ENLTPDQVSQKEG- -IQAEQIVIKITDQGYVTSHGDHYHY YNGKVPYDAL 99 

+ + +G + + ++ TDQG V H H H+ + +P A 

Sbjct : 61 KGVAGVDFPTDDGFILTKDSKILSKTDQGIWDHDGHSHFIFYADLKGSPFEYLIPKGAS 120 

Query: 100 FSEELLMK DPN- - YQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADN 146 

+ + + + DP-f Y+ ADIV E GY ++ D ++ LK + 

Sbjct: 121 LAKPAVAQRAASQGTS KVADPHHHYEFNPAD I VAEDALGYTVRHDDHFHYI LKS S LSG - - 178 

Query: 147 VRTKDEINRQKQEHVKDNEKVNSNVAVARSQG R YTTNDG YVFNPAD I IEDTGNAYI 202 

+ Q + +S V+ A + G + T+DG+ FN 1+ T ++ + 

Sbjct: 179 QTQAQAKQVATRLPQTSSLVSTATANGIPGLHFPTSDGFQFNGQGIVGVTKDSIL 233 

Query: 203 VPHGGHYHYIPXXXXXXXXXXXXXXXXXXXNMQPSQLSYSSTASDNNTQSVAKGSTSKPA 262 

V H GH H I p + + + A 4- T + + + + 
Sbjct: 234 VDHDGHLHPI S FADLRQGGWAHVADQ YDPAKKA-E KPAE THQTPELSE REK 283 
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Query: 263 NKSENLQSLLKELYDSPSA-QRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSKL 321 

ELL + 4-L PS +R DG + G+ PH DH H + S + 

Sbjct: 284 EYQEKLAYLAEKLGIDPSTIKRVETQDGKL GLEYPHHDHAHVLMLSDI 331 



tr Q6HCJ 0 Cell division protein [ftsK] [Bacillus thuringiensis (subsp. 1338 AA 
konkukian) ] 

align 

Score =46.6 bits (109), Expect = 0.002 

Identities = 52/223 (23%), Positives = 96/223 (42%), Gaps = 23/223 (10%) 

Query: 789 AYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKL 84 8 

A 4- + 4- + E P ++E + + 4-4- SI+ + 4- A+E S ++E+ +TS E + 

Sbjct: 585 APIAEEQSWEEAPWEEQPVAEETSIVEE- -QPAAEETSWEEQPVAEETSWEEQPVA 642 

Query: 849 SETGNSTSNSTLEEVPTVDP VQEKVAKFAESYGMKLENVLFNMDGT- - IELYLPSGE 903 

ET EE P V+ VQ++ K + 4- V+ D +E + 

Sbjct: 643 EETSIVEEQPVAEEAPWEEQPWQKEEPKREKKRHVPFNVVMLKQDRARLMERHASRTN 702 

Que ry : 904 VI KKNMADFTGEAPQGNGENKP S ENGKVS TGTVENQPTENKPADS LPEAP - - NE KPVKPE 961 

+ 4- 4-M4-+ P E KP K VE Q E KP + P EKPV+ 

Sbjct : 703 AMQPSMSERVENKPVHQVEEKPQVEEKPVQQWEPQ-VEEKPMQQWVEPQVEEKPVQQ- 760 

Query: 962 NSTDNGMLNPEGNVGSDPM LDPALEEAPAVDPVQEKLEK 1000 

+ 4- P+ V PM + 4-P 4-EE P V+ +4-E + 

Sbjct: 761 WEPQ - - VEEKPMQQ VVVEPQVEEKPVQQWEPQVEE 795 



S P P45386 Immunoglobulin Al protease precursor (EC 3.4.21.72) (IGA1 1849 

IGA4_HAEIN protease) AA 

[iga] [Haemophilus influenzae] align 

Score = 46.2 bits (108), Expect = 0.003 

Identities = 54/226 (23%), Positives = 88/226 (38%), Gaps = 30/226 (13%) 

Query: 78 8 NAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEK 84 7 

4-A +N+ VE P~f T+ Q + A+ +EE T+ E 

Sbjct: 1022 SAQSNNEEIARVETPVPPPAPATESAIASEQPETRPAET AQPAMEETNTANSTETAP 1078 

Query: 84 8 LSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKK 907 

S+T T N E VP4- E K AE+ 4- E V N E +GEV K+ 

Sbjct: 1079 KSDTATQTENPNSESVPS ETTEKVAENPPQENETVAKNEQEATEPTPQNGEVAKE 1133 

Query: 908 NMADF TGEAPQGNGENKPSENGKVSTGTVENQPT ENKPADSLPEAPNEK 956 

+ T EA Q G+ + ++ T 4-4-4-PT EN+P 4-4- 4-4- +K 

Sbjct: 1134 DQPTVEANTQTNEATQSEGKTEETQ TAETKSEPTESVTVSENQPEKTVSQSTEDK 1188 

Query: 957 PV KPENSTDNGMLNPEGNVGSDP - MLDPALEEAPAVDPVQE 996 

V K + T+ P+ P 4-PA EE P 4-E 

Sbjct: 1189 VWEKEEKAKVETEETQKAPQVTSKEPPKQAEPAPEEVPTDTNAEE 1234 
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tr Q8MNS0 Hypothetical protein F12F3.2 [F12F3.2] [Caenorhabditis 6328 
elegans] AA 

a lign 

Score = 46.2 bits (108), Expect = 0.003 

Identities = 53/260 (20%), Positives = 101/260 (38%), Gaps = 34/260 (13%) 

Query: 7 91 LDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSE 85 0 

L+ Q E+ ++E + + +K AQE S+L+++ 4-+ +K+E E S + 

Sbjct: 1368 LEEQKKKAAEIALIEIQKEQEK LAQEQSRLEDEAKKSAEKQKLESETKSK 1417 

Query : 851 TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNM- 909 

+ +-+E P + K S K ++ +D L V KK + 

Sbjct: 1418 QTEEAPKESVDEKPKKKVLKKKTEKSDSSISQKSKSAKSTVDAAETLESDFNLVEKKTVQ 1477 

Query: 910 ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPV 958 

A + Q E ++G T + + P KP DS EA +K V 

Sbjct: 1478 KVEQSPDESTSATIKRDPAQKTEEISKQDDGDEKKTTTDGKPP- -KPEDS- -EATPKKRV 1533 

Query: 95 9 KPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDG 1018 

+++'++++ L +EE P +++K EK DSVI 
Sbjct: 1534 VKKKTQKSDSVASDASLADVSKLSDDVEEKPKKKVLKKKTEK- SDSVISETSS 1585 

Query: 1019 TIELRLPSGEVIKKNLSDLI 1038 

+ + S E+ + +1 
Sbjct: 1586 VDTIKPESVEIPTEKAEQMI 1605 



tr Q8ISF7 2MDa_l protein [isof] [Caenorhabditis elegans] 18534 AA 

align 

Score = 46.2 bits (108), Expect = 0.003 

Identities = 53/260 (20%), Positives = 101/260 (38%), Gaps = 34/260 (13%) 

Query : 7 91 LDNQSTYIVEVPILEKENQTDKPS ILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSE 85 0 

L+ Q E-f ++E + + +K AQE S+L+ + + ++ +K+E E S + 

Sbjct: 13574 LEEQKKKAAEIALIEIQKEQEK LAQEQSRLEDEAKKSAEKQKLESETKSK 13623 

Query : 851 TGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNM- 909 

+++E P K S K ++ +D L V KK + 

Sbjct: 13624 QTEEAPKESVDEKPKKKVLKKKTEKSDSSISQKSKSAKSTVDAAETLESDFNLVEKKTVQ 13683 

Query: 910 ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPV 958 

A + Q E ++G T + +P KP DS EA +K V 

Sbjct: 13684 KVEQS PDESTS AT I KRDPAQKTEE I S KQDDGDEKKTTTDGKPP - -KPEDS - -EATPKKRV 13739 

Query : 959 KPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDG 1018 

+++++++ L +EE P +++K EK DSVI 
Sbjct: 13740 VKKKTQ KS D S VAS DAS L AD VS KL S DD VE E KP KKKVL KKKTE K SDSVISETSS 13791 

Query: 1019 TIELRLPSGEVIKKNLSDLI 103 8 

+ + S E+ + +1 
Sbjct: 13792 VDTIKPESVEIPTEKAEQMI 13811 
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tr Q8ISF6 2MDa_2 protein [isof] [Caenorhabditis elegans] 18519 AA 

align 

Score =46.2 bits (108), Expect = 0.003 

Identities = 53/260 (20%), Positives = 101/260 (38%), Gaps = 34/260 (13%) 

Query: 7 91 LDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSE 850 

Ii+ Q E + ++E + + +K AQE S+L+ + + ++ +K+E E S + 

Sbjct: 13 5 74 LEEQKKKAAEIALIEIQKEQEK LAQEQSRLEDEAKKSAEKQKLESETKSK 13 62 3 

Query : 851 TGNSTSNSTLEEVPTVT5PVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNM- 909 

+ ++E P + + + K K S K 4-4- +D L V KK + 

Sbjct: 13624 QTEEAPKESVDEKPKKKVLKKKTEKSDSSISQKSKSAKSTVDAAETLESDFNLVEKKTVQ 13683 

Query: 910 ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPV 958 

A + Q E ++G T + +P KP DS EA +K V 

Sbjct: 13684 KVEQSPDESTSATIKRDPAQKTEEISKQDDGDEKKTTTDGKPP- -KPEDS- -EATPKKRV 13739 

Query : 95 9 KPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDG 1018 

+ + 4- + + ++ L +EE P + EK DSV1 
Sbjct: 13740 VKKKTQKSDSVASDASLADVSKLSDDVEEKPKKKVLKKKTEK SDSVISETSS 13791 

Query: 1019 TIELRLPSGEVIKKNLSDLI 103 8 

+ + S E+ + +1 
Sbjct: 13792 VDTIKPESVEIPTEKAEQMI 13811 



sp P16053 Neurofilament triplet M protein (160 kDa neurofilament 857 

NFM_CHICK protein) AA 

(Neurofilament medium polypeptide) (NF-M) [NEFM] [Gallus align 
gallus (Chicken) ] 

Score = 44.7 bits (104), Expect = 0.009 

Identities = 58/236 (24%) , Positives = 85/236 (35%) , Gaps = 39/236 (16%) 

Query: 760 GE I KL P I PKLNQGTTRTAGNKI PVTFMANAYLDNQST Y I VE VP I L EKENQTDKPS 814 

G I PI Q + A KI T + L Q ++ E+ EK D S 

Sbjct: 411 GSITGPIFTHRQPSVTIASTKIQKTKIEPPKLKVQHKFVEEIIEETKVEDEKSEMEDALS 470 

Query: 815 ILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVXIPVQEKVA 874 

+ + KAQE + 4-EK EE E-f EK +E EE +E+ A 

Sbjct: 471 AIAEEMAAKAQEEEQEEEKAEEEAVEEEAVSEKAAEQAAEEEEKEEEEAE EEEAA 525 

Query: 875 K- -FAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVS 932 

K AE G K E + +G EA + E K GK 
Sbjct: 526 KSDAAEEGGSKKEEIEEKEEGE EAEEEEAEAK GKAE 561 

Query: 93 3 TGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEA 98 8 

+ + ++ PA S P++P + PV + + VG D + A E+A 

Sbjct: 562 E AG AKVE KVKS P P AKS P P KS P P KS P VTE Q AKAVQKAAAEVGKDQKAEKAAEKA 614 



tr Q97QP7 Immunoglobulin Al protease [SP1154] [Streptococcus 2004 
pneumoniae] AA 

align 
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Score = 44 . 
Identities 

Query: 800 

Sbjct: 512 

Query: 8 60 

Sbjct: 572 

Query: 919 

Sbjct: 615 

Query: 97 0 

Sbjct: 675 

Query: 1027 

Sbjct: 725 



3 bits (103), Expect = 0.012 

= 61/251 (24%) , Positives = 99/251 (39%) , Gaps = 46/251 (18%) 

EVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNST 85 9 
EP+ ET+SI +ES++ PTSK ES + 

ETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVG 571 

LEEVPTVCPVQEK^AKFAESYGMKLENVLFN-MDGTIELYLPSGEVIKKNMADFTGEAPQ 918 

P + + E+ +E V N +GT+E E P 

ESNKPEHNDSKN ENS E KT VEE VP VNPNEGT VE GTSNQETEKPV 614 

GNGENKPSENGKV STGTVENQPTENKP - - ADSLPEAPNEKPVKPENS TDNGML 9 69 

E + +GK+ 4-TG V N+PH- ++KP +S N KPENS ++NG 

QPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEKNGTATKPENSGNTTSENGQT 674 

NPE GNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNMDGTIELRLPS 102 6 

PE GN D + + + + + + + E L D + + T+ELR 
EPEPSNGNSTEDVSTESNTSNSNGNEEIKQENE LDPDKKVEEPEKTLELR 724 

GEVIKKNLSDL 103 7 

N+SDL 
NVSDL 72 9 



tr Q8IB63 Hypothetical protein PF080035 [PF08_0035] [Plasmodium 1176 
falciparum AA 
(isolate 3D7)] align 

Score =44.3 bits (103), Expect = 0.012 

Identities = 56/237 (23%), Positives = 94/237 (39%), Gaps = 27/237 (11%) 

Query: 77 8 GNKIPVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEP 83 7 

GNK N D T E +E E +K +1+ Q Q+N KLDE + + E 

Sbjct : 2 05 GNKKNKNKNKNKNSDKSETNDEEEVTVEME EKENIIEQLNE QQNEKLDEQLNE - 257 

Query : 83 8 KTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIEL 8 97 

+ +E + E + +E + N +E P + ++ F E N F + 

Sbjct : 258 QPNENFD - EQPNENFDEQPNENFDEQPNENFDEQPNENFDEQ PNENFDEQPNENF 311 

Query : 898 YLPSGEVIKKNMADFTGEAPQGNGENKPSEN-GKVSTGTVENQPTEN KPADSLPEAP 953 

E + + E P N + +P+EN + + QP EN +P ++ E P 

Sbjct : 312 DEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQP 371 

Query: 954 NEKPVKPENSTDNGMLNPEGNVGSDPMLDPA LEEAPAVDPVQEKLEK 1000 

ME + N+T + N V ++ ++D +EE D + E+ EK 

Sbjct: 372 NENSDEQSNNTNKDEPNTSNYVNTENVIDQGKGNNSECGNIVEEQNNGDSLDERKEK 428 



Score = 40.0 bits (92), Expect = 0.23 

Identities = 43/193 (22%), Positives = 76/193 (39%), Gaps = 9/193 (4%) 

Query: 8 05 EKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVP 864 

+K+ + K +NK + + K + EE T E EKE + E N N L+E 

Sbjct: 196 QKKKKKKKKGNKKNKNKNKNKNSDKSETNDEEEVTVEMEEKENIIEQLNEQQNEKLDEQL 255 

Query: 865 TVI)PVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENK 924 
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P + + E + + + NF+ E + + EPN+ + 

Sbjct: 256 NEQPNENFDEQPNENFD - EQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQ 314 

Query: 925 PSEN-GKVSTGTVENQPTEN KPADSLPEAPN EKPVKPENSTDNGMLNPEGNVG 976 

P+EN + + QP EN +P + + E PN E+P + + N + + N 

Sbjct: 315 PNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNENFDEQPNEN 3 74 

Query: 977 SDPMLDPALEEAP 98 9 

SD + ++ P 
Sbjct: 375 SDEQSNNTNKDEP 3 87 



tr Q869E1 Similar to Xenopus laevis (African clawed frog) . DNA ligase I 1192 
(EC AA 
6.5,1.1) (Polydeoxyribonucleotide synthase align 
[Dictyostelium discoideum (Slime mold)] 

Score =44.3 bits (103), Expect = 0.012 

Identities = 46/176 (26%) , Positives = 76/176 (43%) , Gaps = 12/176 (6%) 

Query: 820 KRNKAQENSKLD EKVEEPKTSEKVEKEKLSETGNSTSNSTL EEVPTVDPVQE 871 

K + AQE +KL +K E EKVEK++ ET +S EE + E 

Sbjct : 23 KISNAQELNKLKTSPKKKREAWKEKVEKKEKKETKPKRKSSKKNKEEEEEEEQEEQDGE 82 

Query: 872 KVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKV 931 

+ + E Y + E + +++G E+ L E K + + ENK S++ 

Sbjct : 83 EEQEEEEEYQQQDEEIEEDINGEEEMELDENEKEKNKKKKQSLKTK ENKESKSSSS 13 8 

Query: 93 2 STGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEE 987 

S T+EN-f T+ s ++ N K +K + D+ + N D LD L+ + 

Sbjct: 13 9 SKKTIENKETKKPEKQSSKQSNNLKRLKRKKMDDDEEDEEDENKTDDNDLDDMLDD 194 



tr Q9VC00 CG13648-PA [CG13648] [Drosophila melanogaster (Fruit fly)] 2768 AA 

alig n 

Score =43.9 bits (102), Expect = 0.016 

Identities = 57/218 (26%) , Positives = 89/218 (40%) , Gaps = 29/218 (13%) 

Query : 773 TTRTAGNKI PVTFMANAYLDNQSTYI VEVP I - - LEKENQTDKPS ILPQFKRNKAQENSKL 83 0 

T AG KIP + + ++ v P +EKE T+ P+ P ENS 

Sbjct: 1651 TVAPAGEKIPTSSITPDEEPTATSAPVAKPDEDVEKETSTEIPTDAPA- -SSEEDENSST 1708 

Query : 831 DEKVEE PKTSEKVEKEKLSETGNSTSNSTLEEVPT VDPVQEKVAKFAES YGMKLE 8 85 

E P+T + +E G + 4- +T +EVP V + E+V AE E 

Sbjct: 1709 DQIPSEVPEKKPETPAQTPEEG-DIVGATAAPTTSDEVPPVQRLPEEV- -LAEIPQPSTE 1765 

Query: 886 NVLFNMDGTIELYLPSGEVIKKNMADFTGEA PQGNGENKPSENGKVSTGTVENQPT 941 

+ D T PS+4-++EA P + KP+E E +P 

Sbjct: 1766 TGIKQQDETTAA- -PSIDRKEPYVTEIDEEATTVAPISEKDEKPTE EEKPV 1814 

Query : 942 ENKPADSLP - - EAPNEKPVKPENSTDNGMLNPEGNVGS 977 

E KP P E EKP++ + ST+ + GS 

Sbjct: 1815 EQKPTGEEPSEEEEKEKPIEQDVSTEGPVSTEASEAGS 1852 
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tr Q83 9C3 N-acetylmuramoyl -L- alanine amidase, family 4 [EF0252] 503 
[Enterococcus AA 
faecalis (Streptococcus faecalis)] align 

Score = 43.5 bits (101), Expect = 0.021 

Identities = 41/184 (22%) , Positives = 68/184 (36%) , Gaps = 9/184 (4%) 

Query : 82 2 NKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAES YG 8 81 

N ENS + E E T+E E SET +T ST E + + + + E+ G 

Sbjct : 3 0 NLVAENSSVAETTAEATTAEATETTATSETTEATEESTTETESSTESSESATTESTETSG 8 9 

Query: 8 82 MKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPT 941 

+ + + T + E ++TE+ + +E+ T T + 

Sbj ct : 90 TETTDSTTDSTST STTESTTDSTSTSTTESTTDSTSTSTTESSTTPTTTPSSSKE 144 

Query: 94 2 ENKPADSLPEAPN-EKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 10 00 

+ KP S E + KPV P + + p + ++PE +EE 
Sbjct : 145 QPKPGTSTSESKQPAKPVTPTAPAEKPVEQPAASTPQPEIVPPVTNETVG LVEDDET 201 

Query: 1001 FTAS 1004 

FT S 

Sbjct: 2 02 FTVS 2 05 



tr 0^3793 Neurofilament medium subunit [Serinus canaria 4 87 AA 

(Canary) ] alig n 

Score =43.1 bits (100), Expect = 0.027 

Identities = 57/236 (24%), Positives = 85/236 (35%), Gaps = 39/236 (16%) 

Query: 760 GEIKLPIPKLNQGTTRTAGNKIPVTFMANAYLDNQSTYIVEVPIL EKENQTDKPS 814 

G I PI Q + A KI T + L Q E+ EK D S 

Sbjct: 41 GSITGPIFTHRQPSVTIASTKIQKTKIEPPKLKVQHKFVEEIIEETKVEDEKSEMEDALS 100 

Query: 815 ILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVA 874 

+ + KAQE 4- -f EK EE E+ EK +E EE -+-E+ A 

Sbjct: 101 AIAEEMAAKAQEEEQEEEKAEEEAVEEEAVSEKAAEQAAEEEEKEEEEAE EEEAA 155 

Query : 875 K- -FAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVS 932 

K AE G -f E + +G EA + E K GK 
Sbjct: 156 KSDAAEEGGSEKEEIEEKEEGE EAEEEEAEAK GKAE 191 

Query: 933 TGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEA 988 

+ + +-f PA S P+ + P + PV + 4- VG D + A E+A 

Sbjct: 192 EAGAKVEKVKSPPAKSPPKSPPKSPVTEQ AKAVQKAAAE VG KD Q KAE KAAE KA 244 



tr Q9 FN97 Transposon protein-like [Arabidopsis thaliana (Mouse-ear 1089 
cress)] AA 

align 
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Score =43.1 bits (100), Expect = 0.027 

Identities = 31/131 (23%) , Positives = 56/131 (42%) , Gaps = 2/131 (1%) 

Query : 19 SLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQG 78 

+ C Y ++H +N+ +R Y G+Q + PD SQ ++ I + D + 

Sbjct: 792 TFCTYYFDRHIKTKNRAGDR-HYDGGNQEDTHEFDGVPDIFSQAGRDSGKESEIWLQDKD 850 

Query: 79 YVTSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVK-GGYIIKVDGKYYVY 137 

Y +H + P++ LF E L+ +P KD + + E + ++ K D Y + 

Sbjct: 851 YHIAHRYILRNCDQLRPFERLFDESLIAANPGISEKDLNELREKQYSSWLKKYDNSYPEW 910 

Query: 13 8 LKDAAHADNVR 14 8 

L H V+ 
Sbjct: 911 LLS I VHGPMVK 921 



tr Q963 T1 Glutamate- rich protein (Fragment) [GLURP] [Plasmodium 107 8 

reichenowi] AA 

ali gn 

Score =43.1 bits (100), Expect = 0.027 

Identities = 48/182 (26%) , Positives = 82/182 (44%) , Gaps = 21/182 (11%) 

Query: 791 L.DNQSTYIVE VPILEKENQTDKPSILPQFKRN-KAQENSKLDEKVEEPKTSEKVEK 845 

LDN+ +1 E V +L+ EN + + PQ K N + + E +D ++ P+ EK E 
Sbjct: 360 LDNEKEHIDESQHNV1WLQ-ENNINNHQLEPQEKANIESFEPKNIDSEIILPENVEKEEI 418 

Query : 846 EKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVI 905 

+ ++ TLE + + +E V+ + N T+E + 

Sbjct: 419 WD VPS PKHFNHETLEQETSESEHEEAVSEK NAHETVEHEEAVSQES 465 

Query: 906 KKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTD 965 

AD G Q N N+ +EN V + E++P EN+ + SL EA E+ V +N+ + 
Sbjct: 466 NPEKADNDGMVSQ-NSNNELNENEFVESEKSEHEPAENEES - SLEEAHQEEIVPEQNNQE 523 

Query: 966 NG 967 

+G 

Sbjct: 524 SG 525 



tr Q0 7594 K2 protein (Fragment) [Entamoeba histolytica] 24 2 AA 

ali gn 

Score =43.1 bits (100), Expect = 0.027 

Identities = 50/203 (24%), Positives = 80/203 (38%), Gaps = 32/203 (15%) 

Query: 803 ILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEE 862 

IL+ + + IF +N+A EK+EE K E+ EK S TG+ N++ E 

Sbjct: 14 ILDIDQEVKDTKIYSVFLKNEASP EKLEENKEDEEKEKSSSSNTGDEPDNNS - -E 66 

Query: 863 VPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGE 922 

+ D E A + K+E S + + + +D P+ + 

Sbjct: 67 AKSDD EPEASSNDKSDDKVE ASSSDKPEASSSDKPDNKPEASSS 110 

Query: 923 NKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPV KPE-NSTDNGMLNPEGNVG 976 
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+KP + + +NKP +S + P+ KP KPE +S+D PE + 

Sbjct: 111 DKPEAS S SDKPEAS S SDKPDNKPEES S SDKPDNKPE AS S SDKPEAS S SDKPDNKPEAS S S 170 

Query: 977 SDPMLDPALEEAPAVDPVQEKLE 999 

P P EA + D K E 

Sbjct: 171 DKPDNKP EASS SDKPDNKPE 190 



tr Q6 PK21 OGFR protein [Homo sapiens (Human)] 513 AA 

align 

Score =43.1 bits (100), Expect = 0.027 

Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 8/100 (8%) 

Query: 900 PSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 95 9 

P+G+ ++ G P G ++P+E+ + G PT -f + PA+S E P +P 

Sbjct: 385 PAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPSPAGPTRDEPAESPSETPGPRPAG 444 

Query: 960 P ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDP 993 

P E+ + + P G G +P P+ E P P 

Sbjct: 445 PAGDEPAESPSETPGPRPAGPAGDEPAESPS - -ETPGPSP 482 



Score = 35.4 bits (80), Expect = 5.7 

Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 6/82 (7%) 

Query: 914 GEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP ENSTDNG 967 

G +P G + + P+E+- 4- G P + + PA+S E P P P E+ + + 

Sbjct: 379 GPSPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPSPAGPTRDEPAESPSETP 438 

Query: 968 MLNPEGNVGSDPMLDPALEEAP 989 

P G G +P P+ P 
Sbjct: 43 9 GPRPAGPAGDEPAESPSETPGP 4 60 



sp_vs Q9NZT2-2 Splice isoform 2 of Q9NZT2 [OGFR] [Homo sapiens (Human)] 657 AA 

ali gn 

Score = 43.1 bits (100), Expect = 0.027 

Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 8/100 (8%) 

Query: 900 PSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 959 

P+G+ ++ G P G ++P+E+ + G PT ++PA+S E P +P 

Sbjct: 529 PAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPSPAGPTRDEPAESPSETPGPRPAG 588 

Query: 960 P ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDP 993 

P E+ ++ P G G +P P+ E P P 

Sbjct: 58 9 PAGDEPAESPSETPGPRPAGPAGDEPAESPS- -ETPGPSP 62 6 

Score = 35.4 bits (80), Expect = 5.7 

Identities = 21/82 (25%), Positives = 35/82 (42%), Gaps = 6/82 (7%) 
Query: 914 GEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKP ENSTDNG 967 
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G +P G ++P+E+ + G P ++PA+S E P P P E+ ++ 

Sbjct: 523 GPSPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPSPAGPTRDEPAESPSETP 582 

Query: 968 MLNPEGNVGSDPMLDPALEEAP 989 

P G G +P P+ P 
Sbjct: 583 GPRPAGPAGDEPAESPSETPGP 604 



S P Q9NZT2 Opioid growth factor receptor (OGFr) (Zeta-type opioid 677 

OGFR_HUMAN receptor) AA 

(7-60 protein) [OGFR] [Homo sapiens (Human)] align 

Score =42.7 bits (99), Expect = 0.036 

Identities = 26/100 (26%), Positives = 45/100 (45%), Gaps = 8/100 (8%) 

Query: 900 PSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 959 

P+G+ ++ ++ G P G ++P+E + G PT 4- + PA+S E P -fP 

Sbjct: 549 PAGDEPAESPSETPGPRPAGPAGDEPAETPSETPGPSPAGPTRDEPAESPSETPGPRPAG 608 

Query: 960 P ENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDP 993 

P E+ ++ P G G +P P+ E P P 

Sbjct: 609 PAGDEPAESPSETPGPRPAGPAGDEPAESPS- -ETPGPSP 646 



Score = 37.4 bits (85), Expect = 1.5 

Identities = 22/96 (22%) , Positives = 42/96 (42%) , Gaps = 6/96 (6%) 

Query: 900 PSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVK 959 

p + G+ G P G ++P+E+ + G P ++PA++ E P P 

Sbjct: 529 PAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPRPAGPAGDEPAETPSETPGPSPAG 588 

Query: 96 0 P ENSTDNGMLNPEGNVGSDPMLDPALEEAP 98 9 

P E+ ~s- + P G G +P P+ P 

Sbjct: 589 PTRDEPAESPSETPGPRPAGPAGDEPAESPSETPGP 624 



tr Q6HBX5 Hypothetical protein [BT9727_4640] [Bacillus thuringiensis 3471 
(subsp. AA 
konkukian) ] align 

Score =42.7 bits (99), Expect = 0.036 

Identities = 52/208 (25%) , Positives = 84/208 (40%) , Gaps = 28/208 (13%) 

Query: 808 NQTDKP - - S ILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPT 865 

NQ + + P SI PQ+K + ++ K EPK E+VE + N+ N L EV 

Sbjct: 3132 NQPEEPWSITPQYKDGMLKAEKEVSNK- -EPKLGEEVEYRISFK- -NTVENGKLAEVKI 3187 

Query: 866 VDPVQEKVAKFAESYGMK LENVLFNMDGTIELYLPSGEVIKKNMA 910 

D + + + +S K EN++ +1 + +V +K 
Sbjct: 3188 EDQLPDGLEYVKDSVKAKGAIEVKVENGKLTAKYENIIDTKERNITFKV KVKEKAGE 3244 

Query : 911 DFTGEAPQGNGENKPSE -NGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGML 969 

+ A +G N+P E + E+P+K+PEPE VKPE+ + + 

Sbjct: 3245 EIVNRAIVDDGINQPLEPTVSIKPKEPEVKPEDPKEPEVKPEDPKEPEVKPEDPKE- PEV 3303 
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Query: 97 0 NPEGNVGSDPMLDPALEEAPAVDPVQEK 997 

PE 4~P + P 4- P V P K 

Sbjct: 3 3 04 KPED - - PKEPEVKPEDPKEPEVKPEDPK 3 32 9 



tr Q7RQS8 Retinitis pigmentosa GTPase regulator- like protein [PY01014] 674 AA 
[Plasmodium yoelii yoelii] 

a lign 

Score =42.7 bits (99), Expect = 0.036 

Identities = 45/193 (23%), Positives = 82/193 (42%), Gaps = 16/193 (8%) 

Query: 771 QGTTRTAGNKIPVTFMANAYL-DNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSK 82 9 

4- T 4- +P T N+ 4-N4-S + + P + ++P 1+ + + + N 

Sbjct: 431 ENPTSLSNTNVPRTGEGNSIQQENRSPNVADRPPTKTSGVENRPMII -ELEVSSDASNFI 489 

Query: 83 0 LDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLF 889 

LD E + E+ E+E+ E 4- LE++ + +E + E + +4- E + 
Sbjct: 490 LDNDETEEEHDEEPEEEEAEEETDENLQDILEDIEEGED-EEDMEDDDENETLEEES 545 

Query: 890 NMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSL 94 9 

+ SG + N+ D EA GE EN + 4- T EN TE+ A 
Sbjct : 546 NEVTRETSSGGDERPNVTD EASPTEGETTAQENNSDNSRTNENSTTESTNA 596 

Query: 95 0 PEAPNEKPVKPEN 962 

E N+KP KP+N 
Sbjct: 597 - EQKNKKPPKPQN 608 



tr 077320 Hypothetical protein MAL3P3.3 [MAL3P3.3] [Plasmodium 3724 
falciparum AA 
(isolate 3D7)] align 

Score =42.7 bits (99), Expect = 0.036 

Identities = 37/172 (21%), Positives = 76/172 (43%), Gaps = 19/172 (11%) 

Query : 804 LEKENQTDKPS ILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEV 863 

4-EK+N +K + 4- K4-N 4-4- 4- 4-4-4-K 4- K 4-+ +K + N 4- S 4-E 
Sbjct : 10 99 MEKKNDMEKKNDIE- -KKNDMEKKNDMEKKNDMEKKNDMEKKNDMENENNMENKSDIEN- 1155 

Query: 864 PTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGEN 92 3 

+ K+ 4-4- 4- MK EN 4- 4- TIE 4- 4-1+ NM 4- EN 
Sbjct: 1156 ENKMNE YKNN - - MKNENKM IE- NNT I ENNM I ENNM I ENNM I E NNMIEN 1200 

Query: 924 KPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNV 975 

EN -f 4-EN EN 4- N+ +K E+ N +N 4- N+ 

Sbjct: 1201 NM I ENNM I ENNM I ENNM I ENNTKI AYNMNENKNNI KI EDDAKN - Q INVQNN I 1251 



tr Q9L4 Z1 Pyruvate dehydrogenase complex subunit E2 [pdhC] 433 
[Staphylococcus AA 
epidermidis] align 
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Score = 42.4 bits (98), Expect = 0.047 

Identities = 38/176 (21%) , Positives = 71/176 (39%) , Gaps = 17/176 (9%) 

Query: 782 PVTFR^AYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSE 841 

PV+ L++TV ++K+D + QFK + E + SK +EK +E E 

Sbjct : 51 PVSGTVEEVLVDEGTVAWGDVIVKIDAPDAEEM--QFKGHGDDEDSKKEEKEQESPVQE 108 

Query: 842 KVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPS 901 

+ + E + T++ 4-P+ V K+A G-f ++ V N G + 
Sbjct: 109 E AS S TQS QE KTE VDE S KT VKAMP S VRKYARENGVNIKAV- -NGSG KN 153 

Query: 902 GEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKP 957 

G + + + + GN++S+V T+P PE + P 

Sbjct: 154 GRITKEDIDAYLNGGSSEEGSNTSAASESTSSDVWASATQALPEGDFPETTEKIP 209 



tr Q9VGW4 CG14692-PA [CG14692] [Drosophila melanogaster (Fruit fly)] 2762 AA 

align 

Score = 42.4 bits (98), Expect = 0.047 

Identities = 49/217 (22%) , Positives = 88/217 (39%) , Gaps = 12/217 (5%) 

Query: 7 94 QSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGN 853 

+ ST + E P EKE T Li + ++ + ++ LDEK K S + + + + S T 

Sbjct: 1131 KSTSLDEKPSSEKEKSTS LDETPSSEKENSTSLDEKPSPEKESTSLDEKPSSGTEK 1186 

Query: 854 STS NSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKN 90 8 

STS +S E+ ++D + + S+ +N +EPE + 

Sbjct: 1187 STSLDEKSSSEKEKSTSLDEKPSSEKEKSTSLNERPSSEKENSTSLVENPSPEKESTSLD 124 6 

Query: 90 9 MADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGM 968 

+G + + PS + ST E +E++SEP++K++ 

Sbjct: 1247 EKPSSGTEKSTSLDENPSSEKEKSTSLNERPSSEKENSTSQDEKPSSETEKSTSLDEKPS 1306 

Query: 969 LNPEGNVGSDPMLDPALEEAPAVD - - PVQEKLEKFTA 1003 

E + D E + + 4-+D P EK EK T+ 

Sbjct: 13 07 SEKEKSTSLDGKPSSEKEKSTSLDENPSSEK-EKSTS 1342 



Score = 41.2 bits (95), Expect = 0.10 

Identities = 56/211 (26%) , Positives = 91/211 (42%) , Gaps = 38/211 (18%) 

Query: 794 QSTYIVEVPILEKENQT DKPSILPQFKRNKAQENSKLDEKVE -EPKTSEKVEKEKLS 849 

+ ST + E P EKEN T +KPS ++++ LDEK E + S ++ + S 

Sbjct: 1269 KSTSLNERPSSEKENSTSQDEKPS SETEKSTSLDEKPSSEKEKSTSLDGKPSS 1321 

Query : 850 ETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNM 909 

■ E STS L+E P + + +EK E + EN T + PS E + + 

Sbjct: 1322 EKEKSTS LDENPSSE - -KEKSTSLNERPSSEKEN STSLVENPSPEKESTSL 1370 

Query : 910 ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGML 969 

+ KPS + ST EN +E + + SL E P+ + ENST 

Sbjct: 1371 DEKPSSGTEKSTSLDENPSSEKEKSTSLNERPSS EKENSTSQDE- 1414 

Query: 970 NPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 1000 
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P 4-LD + + V 4-K4-+4- 

Sbjct: 1415 KPSSEKEKSTLLDKNTDLMRDLIQVSQKVDE 144 5 



Score = 37.7 bits (86), Expect = 1.2 

Identities = 41/172 (23%) , Positives = 75/172 (42%) , Gaps = 20/172 (11%) 

Query: 8 04 LEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTS 8 56 

LE + D+ ++L 4-4- + ++ LDE K S + 4-44 S T STS 

Sbjct: 1068 LEDAGENDRSTLLENLPSSEKENSTSLDENPLPEKESTSLDEKPSSGTEKSTSLDEKSSS 1127 

Query: 857 NSTLEEVPTVI)PVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADF 912 

4-44-L4-E P+ + 4-EK E + 4- EN ++D E P E 4- 
Sbjct: 1128 EKEKSTSLDEKPSSE- -KEKSTSLDETPSSEKENST-SLD EKPSPEKESTSLDEKPS 1181 

Query: 913 TGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENST 964 

+G + + K S 4- ST E +E + + SL E P+ + ENST 

Sbjct: 1182 SGTEKSTSLDEKSSSEKEKSTSLDEKPSSEKEKSTSLNERPSS EKENST 1230 



Score = 36.6 bits (83), Expect = 2.6 

Identities = 47/183 (25%) , Positives = 76/183 (40%) , Gaps = 23/183 (12%) 

Query: 794 QSTYIVEVPILEKENQT DKPSILPQ FKRNKAQENS KLDE KVEE P KTS 840 

4-ST + E P EKEN T 4-KPS + R+ Q + K+DE++ 4 K + 

Sbjct: 1394 KSTSLNERPSSEKENSTSQDEKPSSEKEKSTLLDKNTDLMRDLIQVSQKVDEEMSKGKAA 1453 

Query : 841 EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLP 90 0 

V G S NS LE+ P+ 4- + + A+ -f K 4N+ P 

Sbjct: 1454 IAWDLPDINKGESV-NSPLEKKPSSQELEDIQAELSTD KETGEPYNLSAEKHKVEP 1509 

Query: 901 SGEVIKKNMADFTGEAPQG-NGENKPSENGKVSTGTVENQPTENKPADSLPEAP-NEKPV 95 8 

IK + T G G 4- PS4 GK T 44 + K D+L P ++ P 

Sbjct: 1510 ISKKTIETTDWDVGLKGNDDPSKPGKTETAATKSVSPDIKDKDTLKLKPDDDSPQ 1565 

Query: 959 KPE 961 

K E 

Sbjct: 1566 KSE 1568 



Score = 35.8 bits (81), Expect = 4.4 

Identities = 42/180 (23%), Positives = 69/180 (38%), Gaps = 15/180 (8%) 

Query: 82 6 ENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLE 8 85 

EN KL.4- ++ + KT 4- 4-EK S T 4 4- 44- 4-4 K ES LE 
Sbjct: 1023 ENMKLEGEIGQLKTVFEQSEEKTSPTKSESLHAEDRKIS GKSKES LE 1069 

Query : 8 8 6 NVLFNMDGTIELYLPSGEVIKKNMADFTG - -EAPQGNGENKPSENGKVSTGTVENQPTEN 94 3 

4- N T4- LPS E D E +4 KPS 4 ST E 4-E 

Sbjct: 1070 DAGENDRSTLLENLPSSEKENSTSLDENPLPEKESTSLDEKPSSGTEKSTSLDEKSSSEK 1129 

Query : 944 KPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEKFTA 1003 

+ + SLEP4 4-K4-4 E + D PE + EKT4- 

Sbjct: 113 0 EKSTSLDEKPSSEKEKSTSLDETPSSEKENSTSLDEKPSPEKESTSLDEKPSSGTEKSTS 118 9 
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tr Q90307 Carassius auratus [Carassius auratus (Goldfish) ] 798 AA 

align 

Score =42.0 bits (97), Expect = 0.061 

Identities = 42/171 (24%), Positives = 63/171 (36%), Gaps = 9/171 (5%) 

Query: 831 DEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFN 890 

+E+ +E E+ E+ K E S EE T +E AE G + + 

Sbjct: 540 EEEKQEDDAEEEAEETKAPEAKASPETEKAEEKQTSGGEEE AEEEGDEKDEADAG 594 

Query: 891 MD- -GTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADS 948 

D E + E KK A+ T E +G E + + K E +P KP 

Sbjct: 595 SDKGSPGEKEPEAKEQPKKVEAEATKEDKKGKAEKEEPKPEKEKVAVTETKPETAKPESP 654 

Query: 94 9 LPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLE 999 

E + P E P + E +P +P 4-EAP +P +K E 

Sbjct: 655 KSESPKEPP- -KTEAPKKEAPKTESPTKEEPKAEPPKKEAPKSEPEDKKKE 703 



tr Q898B0 Hypothetical protein CTC00555 [CTC00555] [Clostridium tetani] 324 AA 

align 

Score =42.0 bits (97), Expect = 0.061 

Identities = 49/168 (29%), Positives = 68/168 (40%), Gaps = 31/168 (18%) 

Query: 821 RNKAQENSKL-DEKVEEPKTSEKVEKEKLSETGNSTSNSTLE EVPTVDPVQEKVAK 875 

+ K Q N + + K E PK S+ EK KLSE L ' +P+ P V 

Sbjct: 161 KEKTQPNKHIKNPKKENPKPSQGKEKPKLSEEQMLAKAENLWGGKINLPSYIPKGYDVTN 220 

Query: 876 FA- -ESYGMKLENVLF NMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGK 930 

+YG K+ + + N +EL + G+ K D GNG P EN K 

Sbjct: 221 IGMDTAYGSKVLKITYKNPNSGNFLELKVLEGD - - KTAFED GNGSKNPKENEK 271 

Query: 931 VSTGTVENQPTEN- -KPADSLPEAPNEKPVKPENSTDNGM-LNPEGNV 975 

+ + P EN KP +4- VK NS NG+ N EGN+ 

Sbjct: 272 NEDDNRQDNPKENVDKPTET VKSVNSNKNGVEYNIEGNI 310 



tr Q6R4Z8 Dehydrin cor29 [Capsella bursa-pastoris (Shepherd's purse)] 261 AA 

align 

Score =42.0 bits (97), Expect = 0.061 

Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 18/113 (15%) 

Query: 792 DNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVE 844 

+N+ T + E+ +E++ +KPS++ + R+ + +S DE+ EE K + VE 

Sbjct : 8 0 ENKITLLEELQEKTEEDEENKPSVIEKLHRSNSSSSSSSDEEGEEKKKKKTVEGEEEKKG 13 9 

Query: 845 KEKLSETGNSTSNSTLEEVPTVD PVQEKVAKFAESYGMKLENVL 888 

KEKL G+ + +VP V PV E V 4- E+ G + + V+ 

Sbjct: 140 AMDKIKEKL--PGHHDKETEDHDVPWSTIQVPVSESWEHHETEGEEKKGVM 190 
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tr Q8 MMQ1 Similar to Required for the transfer of mannosylphosphate to 
cell 

wall mannans . ; Mnn4p [Dictyostelium discoideum (Slime 
mold)] 

Score = 41.6 bits (96), Expect = 0.080 

Identities = 40/171 (23%), Positives = 76/171 (44%), Gaps = 15/171 (8%) 

Query: 805 EKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNST S 856 

E+-E + +K P+ K + ++K +E+ E+ + EKVE+E+ SET +T + 
Sbjct: 205 EEEEEEEKVEEKPKKVIKKQKVSNKTEEQKEKVEEEEKVEEEEQSETTTTTQKKKKFISN 264 

Query: 857 NSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKN MADFT 913 

+ST++ + + QEK A + K L + 4- +E V++K + 

Sbjct: 265 DSTIDIMKQISEEQEKKANQKITQEKKKTQQLQSKE- -VEKQTKKKNVVEKKKTILKKLK 322 

Query: 914 GE APQGNGENKP S ENGKVS TGT VENQPTENKPAD S L PE APNEKP VKPENS T 964 

GEQ + K E V+T + + + E P+E + P + 

Sbjct: 323 GEEEQ- -EQEKEEEKPWAKTTTTTKKSVSKKPKTQVEQPSEEQTTPSKKS 371 



393 
AA 

align 



tr Q8IBL1 Hypothetical protein MAL7P1.129 [MAL7P1 . 129] [Plasmodium 1003 
falciparum AA 
(isolate 3D7)] a lign 

Score = 41.6 bits (96), Expect = 0.080 

Identities = 53/233 (22%), Positives = 87/233 (36%), Gaps = 32/233 (13%) 

Query: 402 YIPKSNQIGQPTLPNNSLATPSPSLPINPGISHEKHEEDGYGFDANRII - -AEDESGFIM 459 

Y P + +LP+N S S +NP +K +N I EDE 

Sbjct: 338 YSPNKTKNNNSSLPDNKKTNKSNSY-WPTQKKDKINNSNTTWSNNISHEKEDEKRKKS 396 

Query: 460 SHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKI 519 

H + N KK++ + K + + + K +NG + ++H N +KK+ 
Sbjct: 397 DHRDLNETE - KKNIKLDYPKDFESNNKHDKHINNGDEEKNAHS SHSQNNK EKKL 449 

Query: 520 EEKIAGIMKQYGVKRES IWNKE KNAIIYPHGDHHHADPIDEH 562 

+++ +KR ++ K+ K 1+ + H++ I+EH 

Sbjct: 450 HSNDTEALRRSSIKRSTVDYEKKNLKSESYYGKEYLNSYSKEDIMKEGKEKKHSNDINEH 509 

Query: 563 KPVGIGHSHSN- - YELFKPEEGVAKKEGNKVYTGEELTNWNLLKNSTFNNQN 613 

+ SHSN E K E+ KK+ N NV K T NN N 

Sbjct: 510 NNIEGSLSHSNRLRES IKFED KKKNNNS NN Y S H YNNVHG AF KKHTHNNDN 559 



S P Q 8CT13 Dihydrolipoyl lysine- residue acetyltransf erase component of 433 

ODP2_STAEP pyruvate AA 

dehydrogenase complex (EC 2.3.1.12) (E2) align 
(Dihydrolipo amide acetyltransf erase component of 
pyruvate dehydrogenase complex) [pdhC] [Staphylococcus 
epidermidis] 

Score = 41.2 bits (95), Expect = 0.10 

Identities = 38/176 (21%), Positives =70/176 (39%), Gaps = 17/176 (9%) 
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Query: 782 PVTFMANAYLDNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSE 841 

PV+ L-f+TV + + K + D + QFK + E + SK +EK +E E 

Sbjct : 51 PVSGTVEEVLVDEGTVAWGDVIVKIDAPDAEEM- -QFKGHGDDEDSKKEEKEQESPVQE 108 

Query: 842 KVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPS 901 

+ + E + T++ +P+ V K+A Q+ ++ V N G + 
Sbjct: 109 EASSTQSQEKTEVDESKTVKAMPS VRKYARENGVNIKAV- -NGSG KN 153 

Query: 902 GE VI KKNMADFTGE APQGNGENKP S ENGKVS TGT VENQPTENKPAD S L PE APNE KP 957 

G + K+ + + + G N + S-f V T+ P PE + P 

Sbjct: 154 GRITKEDIDAYLNGGSSEEGSNTSVASESTSSDWNASATQALPEGDFPETTEKIP 209 



tr 033741 SpsA protein [Streptococcus pneumoniae] 53 9 AA 

align 

Score = 41.2 bits (95), Expect = 0.10 

Identities = 48/197 (24%), Positives = 82/197 (41%), Gaps = 21/197 (10%) 

Query: 794 QSTYIVEVPILEKENQTDKPS ILP QFKRNKAQENSKLDE KVEEPKTSEK 842 

++ Y+ E+ +LE++++ + PS + +FK++ + K+ E KVEE K + 

Sbjct: 97 KTKYLRELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAE 156 

Query: 843 VEKEKLSETGNSTSNSTLE-EVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPS 901 

+KE+ + + TLE E+ D K AE +K E F +GTI+ + 
Sbjct: 157 DQKEEDRRNYPTNTYKTLELE I AE FDVK VKEAELELVKEEAKEFRNEGTIK QA 209 

Query: 902 GE V I KKNMADFTGEAPQGNGENKPS ENGKVS TGT VENQPTENKPADSLPE APNE KPVKPE 961 

E++A+T KEK E + KPA+ AP +P KP 

Sbjct: 210 KEKVESKKAEATRLENIKTDRKKAEEEAKRKAA- -EEDKVKEKPAEQPQPAPATQPEKPA 267 

Query: 962 NSTDNGMLNPEGNVGSD 978 

+ P+ D 

Sbjct: 268 PKPEKPAEQPKAEKTDD 284 



tr Q8 I1W5 Hypothetical protein PFD0320c (Fragment) [PFD0320c] 2228 
[Plasmodium AA 
falciparum (isolate 3D7)] align 

Score = 41.2 bits (95), Expect = 0.10 

Identities = 37/161 (22%), Positives = 60/161 (36%), Gaps = 6/161 (3%) 

Query : 805 EKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVP 864 

EKEN T + + + K N +E K+ E+ E+ EK + + E + EE 

Sbjct: 1370 EKENMTAEKENMTEEKENMTEEKEKMTEEKEQVTEKEKEKMTEEKENMTEKKENMTEEKE 1429 

Query : 865 TVD P VQE KVAKF AES YGMKLENVL FNMDGT I E L YLPS GE V I KKNMAD FTGE APQGNGENK 924 

+ +E + + + E +E+ + E E I + TEQ E + 

Sbjct: 1430 QITEEKEQITEEKEQMTEEKEQITEEKEQITE EKEQITEEKEQITEEKEQITEEKE 1485 

Query: 925 PS ENGKVS TGTVENQPTENKPADSLPEAPNEKPVKPENSTD 965 

K + Q TE K + + E K EN TD 

Sbjct: 14 86 QITEEKEQITEEKEQITEEK- -EQITEEKENMTEKKENMTD 1524 
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tr Q0 7593 K2 protein (Fragment) [Entamoeba histolytica] 23 8 AA 

align 

Score = 41.2 bits (95), Expect = 0.10 

Identities = 46/205 (22%), Positives = 79/205 (38%), Gaps = 18/205 (8%) 

Query: 80 3 ILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNST 85 9 

IL+ + + IF +N+A EK+EE K E+ EK S TG+ N++ 

Sbjct: 14 ILDIDQEVKDTKIYSVFLKNEASP EKLEENKEDEEKEKS S S SNTGDEPDNNS EAK 68 

Query: 860 LEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQG 919 

+ + P + KSKE ++ S + + + P + 

Sbjct : 69 SDDEPEASSNDKSDDKVEASSSDKPE AS S SDKPE AS S SDKPDNKPEAS S SDKPEE 123 

Query : 92 0 NGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDP 97 9 

+ +KP + + +NKP +S + P+ KP + S+D PE + P 

Sbjct : 124 S S SDKPEAS S SDKPEAS S SDKPDNKPEES S SDKPDNKP - - E AS S SDKPDNKPEAS S SDKP 181 

Query: 980 MLDPALEEAPAVDPVQEKLEKFTAS 1004 

P EA + D K E +A+ 

Sbjct: 182 DNKP EASS SDKPDNKPEAS SAT 2 03 



S P Q28820 Triadin [TRDN] [Oryctolagus cuniculus 7 05 AA 

TRDN_RABIT (Rabbit)] align 

Score =40.8 bits (94), Expect = 0.14 

Identities = 42/186 (22%), Positives = 68/186 (35%), Gaps = 12/186 (6%) 

Query: 806 KENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPT 865 

K+ ++P Q K-f K +E +K +K E SEK K K+ + + ST + VP 

Sbjct: 384 KKPAEEQPKGKKQEKKEKHEEPAKSTKK-EHAAPSEKQAKAKIERKEEVSAASTKKAVPA 442 

Query: 866 VDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKP 925 

+EK KEK GILE+K+ +EK 

Sbjct: 443 KK- -EEKTTKTVEQETRK EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETKK 494 

Query: 926 S ENGKVS TGT VENQ PTENKP AD S LPE APNE KP VKPENS TDNGMLNPEGNVGS D PMLD PAL 985 

E ++ E KP + P + +K E + ++ PE + L 

Sbjct: 495 DEKTSKPEPQIKK EEKPGKEVKPKPPQPQIKKEEKPEQDIMKPEKTALHGKPEEKVL 551 

Query: 986 EEAPAV 991 

4-+ AV 
Sbjct: 552 KQVKAV 557 

Score = 35.8 bits (81), Expect = 4.4 

Identities = 22/82 (26%), Positives = 40/82 (47%), Gaps = 2/82 (2%) 

Query: 799 VEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 858 

+ ++ KE + +K + + K K + + EK E+P+T ++EK + T 
Sbjct: 158 IPTKWHKEKEKEKEKVKEKEKPEKKATHKEKIjEKKEKPETKTVTKEE 217 

Query: 85 9 TLEEVPTV- -DPVQEKVAKFAE 878 

T +EV V + V++ VAK E 
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Sbjct: 218 TKKEVKGVKQEKVKQTVAKAKE 23 9 



tr Q7SXW9 Hypothetical protein (Fragment) [Brachydanio rerio (Zebrafish) 232 AA 
(Danio rerio) ] 

align 

Score = 40.8 bits (94), Expect = 0.14 

Identities = 37/134 (27%), Positives = 55/134 (40%), Gaps = 18/134 (13%) 

Query: 82 6 ENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLE 8 85 

+N DE E K +E K E G++ +N P E A+ G 

Sbjct: 12 KNGAKDETAAE - KPAEAANKSNGQENGHAKTNGN AS PNAEAAAEDVQANGKHAA 64 

Query: 8 86 NVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKP 94 5 

DG + + E K AD AP+G GE + NG+ ST T E+ T n~+P 

Sbjct: 65 DGEVK AEEGKAEEADAEKAAPEGEGESSAVANGEDSTKTEESAATSSEP 113 

Query: 94 6 ADSLPEAPNEKPVK 95 9 

A + +KP K 

Sbjct: 114 AKTKKRFSFKKPFK 127 



tr Q8DPR5 Immunoglobulin Al protease (EC 3.4.24.13) [iga] 1963 
[Streptococcus AA 
pneumoniae (strain ATCC BAA-255 / R6)] align 

Score =40.8 bits (94), Expect = 0.14 

Identities = 45/183 (24%) , Positives = 74/183 (39%) , Gaps = 27/183 (14%) 

Query: 800 EVPILEKENQTDKPSILP-QFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 858 

E P+ E T+ SI + A+E + + EKV +SE + + + ++TS 

Sbjct: 512 ETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSENTGEVSSNPSDSTTSVG 571 

Query: 85 9 TLEEVPTVTJPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQ 918 

+ D E K E + 4-GT+E E P 

Sbjct: 572 ESNKPEHNDSKNENSEKTVEEVPVNPN EGTVE GTSNQETEKPV 614 

Query: 919 GNGENKP S ENGKVS TGTVENQPTENKP - - ADSLPEAPNEKPVKPENS TDNGML 969 

E + +GK++ TG V N+P+++KP +S N KPENS + +NG 

Sbjct: 615 QPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEKNGTATKPENSGNTTSENGQT 674 

Query: 970 NPE 972 

PE 

Sbjct: 675 EPE 677 



tr Q59 947 Immunoglobulin Al protease (EC 3.4.24.13) [iga] 1964 
[Streptococcus AA 
pneumoniae] align 

Score = 40.8 bits (94), Expect = 0.14 

Identities = 45/183 (24%), Positives = 74/183 (39%), Gaps = 27/183 (14%) 
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Query: 800 EVPILEKENQTDKPSILP-QFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 858 

E P + E T+ SI + A+E + + EKV + SE + + + ++TS 

Sbjct: 512 ETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSENTGEVSSNPSDSTTSVG 571 

Query: 859 TLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQ 918 

+ D E K E + +GT+E + + E P 

Sbjct: 572 ESNKPEHNDSKNENSEKTVEEVPVNPN EGTVE GTSNQETEKPV 614 

Query: 919 GNGENKP S ENGKVS TGTVENQPTENKP - - ADSLPEAPNEKPVKPENS TDNGML 96 9 

E + +GK++ TG V N+P+ + + KP +S N KPENS ++NG 

Sbjct: 615 QPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEKNGTATKPENSGNTTSENGQT 674 

Query: 970 NPE 972 
PE 

Sbjct: 675 EPE 677 



tr Q28688 Neurofi lament -H (Fragment) [Oryctolagus cuniculus (Rabbit)] 511 AA 

align 

Score = 40.8 bits (94), Expect = 0.14 

Identities = 54/233 (23%), Positives = 92/233 (39%), Gaps = 18/233 (7%) 

Query: 7 80 KI PVTFMANAYLDNQSTY I VE VP ILEKENQTDKPS I LPQFKRNKAQENS KLDEKVEEPKT 83 9 

K PV A + +S + P+ E+ +K P + E +K EK + P 

Sbjct : 275 KS P VKEE AKS PEKAKS PEKAKS PVKEE AKS PEKAKS - PVKEEAKS PEKAKS PEKAKS PVK 333 

Query : 840 SEKVEKEKLSETGNSTSNSTLEEVPTVD P VQE KVAKF AE S YGMKLENV - L FNMDGT 894 

E EK +++ + EE + + PV+E+ AK E K E V + 

Sbjct: 334 EEAKS PEK - AKS PEKAKS PVKEEAKS PEKAKS PVKEE - AKS PEKETPKKEE VKVKEPPKK 391 

Query : 8 95 I EL YLPSGEV I KKNMADFTGEAPQGNGENKPS ENGKVS TGTVENQPTENKP ADS LPEAPN 954 

+E P+ ++K+ EAP+ E K ST + E + K + + 

Sbjct : 3 92 VEETAPAPPKVEKDSKK- -DEAPKKEAPKPAVEKPKESTAEAKKDEAEDKKTAAPAKVEG 44 9 

Query: 955 EKPVKPENSTDNGMLNPEGNVGSDPML DPALEEAPAVDPVQEKLEK 1000 

+ + KP+ T+ PE +P P EE PA PV++ + + 

Sbjct: 450 KE E AKP KE KTE VAKKE PED AKAKE P S KPT E KE P E KP KKE E T P AA - P VKKNAKE 501 



tr Q94674 Thrombospondin- related anonymous protein (Fragment) [TRAP] 614 AA 
[Plasmodium gallinaceum] 

align 

Score =40.8 bits (94), Expect = 0.14 

Identities = 44/209 (21%), Positives = 84/209 (40%), Gaps = 28/209 (13%) 

Query: 792 DNQSTYIVEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSET 851 

D+ IE E +KP P+K++ KEVEKEV +EK E + 

Sbjct: 299 DEEPEPIPEEKKPEPVPEEKKPESAPEEKNPESVPEEKKPESVPEEKEPESVPEEKEPES 358 

Query : 852 GNSTSNSTLEEVPTVDP VQE KVAKF AESYGMKLENVL FNMDGT I EL YLPSGEV I KKN 908 

++ E+ P DP E G K+E + P E 
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Sbjct: 359 VPEEKEPESAPEEKKPESDPEEKKLEPIPE - -GKKIEPI PEEEKLEPI 404 

Query: 90 9 MADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDN- - 966 

+ E+ + E++P +G+ + +N P + + + PN+ + + P+N D+ 

Sbjct: 405 PEEKKPESVTEDRESEPVPDGE-AENVPQNIPDDEQEEKISGDIPNDEELIPKNEPDDIK 463 

Query: 967 GMLNPEGNVGSDPMLDPALEE 987 

+ + P+ + + +P EE 

Sbjct: 464 RNEYDTTPNIIPPKDTYNDNEITNPISEE 492 



tr Q6BLN0 Similar to ca | CA2433 | IPF12959 Candida albicans IPF12959 782 
unknown AA 
function [DEHA0F13211g] [Debaryomyces hansenii CBS767] align 

Score =40.8 bits (94); Expect = 0.14 

Identities = 40/213 (18%), Positives = 90/213 (41%), Gaps = 10/213 (4%) 

Query: 800 EVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNST 85 9 

EV L E T-t- P + + ++ + QE ++++EE +S+ +S + 
Sbjct: 315 EVKRLPGEVDTELPIVKQELEQKRIQE ERESKSNQIEETAETTAVSQNDDSNEENK 370 

Query : 860 LEEVPTVDPVQEKVAKFAESYGMKLE NVLFNMDGTIELYLPSGEVIKKNMA- DFTGE 915 

E+ + P ++ ++ +E + E + N + + + N+ + + T + 

Sbjct : 371 NIEIDNITPPSKESSELSEQVDVSKEVTGESISNEEVETQTITEDDKNANPNISREATIK 430 

Query : 916 APQGNGENKPSENGKVSTGTVENQPTENKPADSLPEA- PN- EKPVKPENSTDNGMLNPEG 973 

+ N+ SEN +V ++N + DS + + PN + P+K E S+++ ++N 
Sbjct : 431 ENTASNSNQASENDEVIDNDLDNNDEKLTENDSVSNSKPNVDAPIKEEQSSNDDLINESS 4 90 

Query: 974 NVGSDPMLDPALEEAPAVDPVQEKLEKFTASYG 1006 

++ D E A+D + E + G 
Sbjct: 491 KEDTNKPKDDDGETQAAIDS IVSDEENLLNNKG 523 



S P_ VS Q28820- Splice isoform Cardiac 3 of Q28820 [TRDN] [Oryctolagus 668 
4 cuniculus AA 

(Rabbit) ] align 

Score =40.8 bits (94), Expect = 0.14 

Identities = 42/186 (22%), Positives = 68/186 (35%), Gaps = 12/186 (6%) 

Query: 8 06 KENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPT 8 65 

K+ ++P Q K+ K +E +K +K E SEK K K+ ++ ST + VP 

Sbjct: 3 84 KKPAEEQPKGKKQEKKEKHEEPAKSTKK-EHAAPSEKQAKAKIERKEEVSAASTKKAVPA 442 

Query: 866 VD P VQE KVAKF AE S YGMKLENVL FNMDGT I EL YL PS GE V I KKNMAD FTGEAPQGNGENKP 925 

+EK KEK GILE+K+ +EK 

Sbjct: 443 KK- -EEKTTKTVEQETRK EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETKK 494 

Query: 92 6 SENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPAL 985 

E ++ E KP + P + +K E + + + PE + L 

Sbjct: 495 DEKTSKPEPQIKK EEKPGKE VKPKPPQPQ I KKEEKPEQD IMKPEKTALHGKPEE KVL 551 

Query: 986 EEAPAV 991 
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+ + AV 
Sbjct: 552 KQVKAV 557 



Score =35.8 bits (81), Expect = 4.4 

Identities = 22/82 (26%), Positives = 40/82 (47%), Gaps = 2/82 (2%) 

Query: 799 VEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 858 

+ ++ KE + +K + + K K + + EK E+P+T ++EK + T 

Sbjct: 158 IPTKWHKEKEKEKEKVKEKEKPEKKATHKEKLEKKEKPETKTVTKEEKKARTKEKIEEK 217 

Query: 859 TLEEVPTV- -DPVQEKVAKFAE 87 8 

T +EV V + V++ VAK E 
Sbjct: 218 TKKEVKGVKQEKVKQTVAKAKE 23 9 



sp_vs Q2882Q- Splice isoform Skeletal 3 of Q28820 [TRDN] [Oryctolagus 698 
6 cuniculus AA 

(Rabbit)] align 

Score = 40.8 bits (94), Expect = 0.14 

Identities = 42/186 (22%) , Positives = 68/186 (35%) , Gaps = 12/186 (6%) 

Query: 8 06 KENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPT 865 

K+ ++P Q K+ K +E +K +K E SEK K K+ ST + VP 

Sbjct: 384 KKPAEEQPKGKKQEKKEKHEEPAKSTKK-EHAAPSEKQAKAKIERKEEVSAASTKKAVPA 442 

Query: 8 66 VDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKP 92 5 

+EK KEK GILE+K+ +EK 

Sbjct: 443 KK--EEKTTKTVEQETRK EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETKK 494 

Query: 92 6 SENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGNLNPEGNVGSDPMLDPAL 985 

E ++ E KP + P + +K E + ++ PE + L 

Sbjct: 495 DEKTSKPEPQIKK EEKPGKEVKPKPPQPQIKKEEKPEQDIMKPEKTALHGKPEEKVL 551 

Query: 986 EEAPAV 991 

+ + AV 
Sbjct: 552 KQVKAV 557 

Score = 35.8 bits (81), Expect = 4.4 

Identities = 22/82 (26%), Positives = 40/82 (47%), Gaps = 2/82 (2%) 

Query: 799 VEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 858 

+ ++ KE + +K + + K K + + EK E+P+T ++EK + T 
Sbjct: 158 IPTKWHKEKEKEKEKVKEKEKPEKKATHKEKLEKKEKPETKTVTKEEKKARTKEKIEEK 217 

Query: 859 TLEEVPTV- -DPVQEKVAKFAE 878 

T +EV V + V~h+ VAK E 
Sbjct: 218 TKKEVKGVKQEKVKQTVAKAKE 23 9 



tr Q54875 IgAl protease [iga] [Streptococcus pneumoniae] 1927 AA 

align 
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Score = 40.4 bits (93), Expect = 0.18 

Identities = 67/265 (25%) , Positives = 109/265 (40%) , Gaps - 43/265 (16%) 

Query : 7 92 DNQSTYIVEV PILEKENQTDK- - PS ILPQFKRNKAQENSK- -LDEKVEEPKTSEKVE 844 

D QS I'VE P L + +DK P + P + + + S+K E 

Sbjct: 409 DVQSGAIVEPAIQPELPEAWSDKGEPEVQPTLPEAWTDKGEPAVQPELPEAWSDKGE 468 

Query : 845 KEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEV 904 

E+ + + N +E+V PV+ K E KEV + TE + E 

Sbjct: 469 PEQVAPLPEYKGN- - IEQVKPETPVE KTKEQGPEKTEEV- -PVKPTEETPVNPNEG 52 0 

Query: 905 I KKNMAD FTGE APQGNGENKP S ENGKVS TGTVENQPTENKP- -ADSLPEAPNEKPVK 95 9 

+ + E P E+ + +GK++ TG V N+P+++KP +S N K 

Sbjct : 521 TTEGTSIQGAENPVQPAEDTQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEKNGTATK 58 0 

Query: 960 PENS TDNGMLNPE GNVGSDPMLDPALEEAPAVDPVQEKLEKFTASYGLGLDSV 1012 

PENS + +NG PE GN D + + ++++ E L D 

Sbjct : 581 PENSGNTTSENGQTEPEPSNGNSTEDVSTKSNTSNSNGNEE I KQENE LDPDKK 633 

Query: 1013 I FNMDGT I ELRLPSGEVI KKNLSDL 1037 

+ + + T+ELR N+SDL 
Sbjct: 634 VEDPEKTLELR NVSDL 649 



tr Q9GUY4 Crustocalcin [DD4 (ccn) ] [Penaeus japonicus (Kuruma 831 AA 

prawn) ] align 

Score = 40.4 bits (93), Expect = 0.18 

Identities = 54/241 (22%), Positives = 90/241 (36%), Gaps = 22/241 (9%) 

Query: 8 05 EKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVP 864 

E E +++ + + + + S +E + EEP+ +EK. +E G EE 
Sbjct : 119 ESEGRSEGTQEEEEEEEEEGESESSGNERSEEPEKEGAEGEEKSAEPG- AEEAE 171 

Query : 8 65 T VDP VQE KVAKFAE S YGMKLENVL FNMDGT I EL YL P S GE V I KKNMADFTGE APQGNG 921 

+ E AK AE+ GE DE GE K + A + 

Sbjct : 172 GEEEAAEPEAKGAEAEGKSAETEKGKSDAEAE - EEAEGEGASKKASEENADVEKAQEVEA 230 

Query : 922 ENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNV 975 

+ + + G E+ PE P PVK E++ E V 

Sbjct : 231 ASSSVDEVAAAEGDASEAAAESTAKQLSPETPALVPVKVESTFAPVSPVTEAPVIATAAM 290 

Query: 976 - - GSDPMLDPALEEAPAVD -PVQEKLEKFTASYGLGLDSVI FNMDGT I - -ELRLPSGEVI 103 0 

+ P++ P E PA P E +AS GLG ++ + + E+ LP+ EV+ 

Sbjct : 291 LPAAAPLVAPVTVETPAAPVPSPEAYSADSASGGLGSQDLVLWIPPVEQEISLPNAEW 350 

Query: 1031 K 1031 

Sbjct: 351 E 351 



tr Q9GTX2 Glutamate-rich protein [GLURP] [Plasmodium 1236 AA 

falciparum] align 

Score = 40.4 bits (93), Expect = 0.18 
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Identities = 49/185 (26%), Positives = 84/185 (44%), Gaps = 27/185 (14%) 

Query: 791 LDNQSTYIVE VPILEKENQTDKPSILPQFKRN-KAQENSKLDEKVEEPKTSEKVEK 845 

LDNQ +1 + + +L+ EN 4- + PQ K N 4-+ E 4-D ++ P E VE 
Sbjct: 361 LDNQKEHIDQSQHNINVLQ-ENNINNHQLEPQEKPNIESFEPKNIDSEIILP ENVET 416 

Query: 846 EKLSETGNSTSNS TLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSG 902 

E4- + + S 4-S T EE + +E V++ N T4-E 

Sbjct: 417 EEIIDDVPSPKHSNHETFEEETSESEHEEAVSEK NAHETVEHEETVS 463 

Query: 903 EVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPEN 962 

+ AD G Q N N+ 4-EN V + E++P EN+ + SL E +E + V +N 

Sbjct: 464 QESNPEKADNDGNVSQ-NSNNELNENEFVESEKSEHEPAENEES-SLEEGHHEEIVPEQN 521 

Query: 963 STDNG 967 

+ ++G 

Sbjct: 522 NEESG 526 



tr Q8IJ56 Glutamate-rich protein [PF10_0344] [Plasmodium falciparum 1233 
(isolate AA 
3D7)] align 

Score = 40.4 bits (93), Expect = 0.18 

Identities = 49/185 (26%), Positives = 84/185 (44%), Gaps = 27/185 (14%) 

Query: 791 LDNQSTYIVE VPILEKENQTDKPSILPQFKRN-KAQENSKLDEKVEEPKTSEKVEK 845 

LDNQ 4-1 + 4- +L+ EN + + PQ K N +4- E +D ++ P E VE 
Sbjct: 361 LDNQKEHIDQSQHNINVLQ-ENNINNHQLEPQEKPNIESFEPKNIDSEIILP ENVET 416 

Query: 846 EKLSETGNSTSNS TLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSG 902 

E++ 4- S +S T EE + +E V+ + N T+E 

Sbjct: 417 EEIIDDVPSPKHSNHETFEEETSESEHEEAVSEK NAHETVEHEETVS 463 

Query: 903 EVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPEN 962 

+ AD G Q N N+ 4-EN V + E+4-P EN4- 4- SL E +E+ V +N 

Sbjct: 464 QESNPEKADNDGNVSQ-NSNNELNENEFVESEKSEHEPAENEES -SLEEGHHEEIVPEQN 521 

Query: 963 STDNG 967 

4- +4-G 

Sbjct: 522 NEESG 526 



tr Q6FWC0 Candida glabrata strain CBS138 chromosome D complete sequence 1148 AA 
[CAGL0D01364g] [Candida glabrata CBS138] 

a lign 

Score = 40.4 bits (93), Expect = 0.18 

Identities = 58/253 (22%), Positives = 107/253 (41%), Gaps = 40/253 (15%) 

Query: 753 LENNYKVGE I KLP I PKL NQGTTRTAGNKI PVTFMAN AY LDNQSTYIVE V 801 

+EN 4- LP + P N T TA + + M +A D + VE + 

Sbjct : 783 VENGQEETTTSLPVPATEETNHNNETVGTAED - LKDEAMHDATQDGEKHQAVEESVTTDL 841 

Query : 802 PILEKENQTDKPSILPQFKRNKAQENSKLDEK-VEEPKTSEKVEKEKLSETGNSTSNSTL 860 
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P E+E D+P+ ++++ +EN+ + + EEP S+ + +K SET +T + 

Sbjct : 842 PSTEQEKHLDEPA QKSEVEENTRAETAGYEEP- -SDLPDAQKSSETTGTTETAEA 894 

Query: 861 EEVPTVDPVQE KVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEA 916 

E + E + AK AE+ +L + N + IE + E + 4- + E 

Sbjct : 895 EHTEVTNVPSEVETSETAKPAETTADQLTGEI -NQEKQIEQPSEASESVPEEKTEV- - EQ 951 

Query : 917 PQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNV- 975 

P KP+E T4- + + -H + + + + +PVK +ST+ +P N+ 

Sbjct : 952 P KPAEEDSKGNDTPQEEVSISNTETTMADESSTQPVK- -DSTETNQNDPADNMT 1003 

Query: 976 GSDPMLDPA 984 

G D +D A 
Sbjct: 1004 EESHGQDSTIDEA 1016 



tr Q6CTI0 Similar to sp|Q05050 Saccharomyces cerevisiae YMR031c 83 0 

[KLLA0C12573g] AA 
[Kluyveromyces lactis NRRL Y-1140] align 



Score 


= 40 


.4 bits (93), Expect = 0.18 




Identities 


= 48/219 (21%), Positives = 83/219 (36%), Gaps = 17/219 (7%) 




Query: 


799 


VEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 


858 






+EV + E QT+ + +KRK+ +EVEK +K+++EK 4-TS 




Sbjct : 


593 


LEVDAIINERQTELSNTEIEVKREKLKLIDAMKE-VAEVKNEDKIDEEKAKAFLGTTSGE 


651 


Query: 


859 


TLEEVPTVDPVQ- -EKVAKFAE- - SYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTG 


914 






L V+P + K +E S K+E V N+ + P+ + K ++ T 




Sbjct : 


652 


FLASQKKVEPATKLQSDPKLSEPSSKSTKIEGVTGNVKADVPASPPAHK- -KHSIGGLTS 


709 


Query: 


915 


EAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGN 


974 






+K + + +NK + P + KP N + E 




Sbjct: 


710 


PLKSKKKSDKDQKGSSIKKFFGLKPSDQNKNTKTTQPTPLKSSPKPSNKPVTATVTTEKK 


769 


Query: 


975 


VGSDPMLDPALEEAPAVDP VQEKLEKFTAS 1004 








+P A E P+ + + P V K+E + AS 




Sbjct : 


770 


ENVEPK-STATETKPSLEPSFSGFSQGSVHNKVEQSDAS 8 07 





tr Q8 79S6 Hemagglutinin-like secreted protein [pspA] [Xylella 3457 
fastidiosa AA 
(strain Temeculal / ATCC 700964)] align 

Score = 40.0 bits (92), Expect = 0.23 

Identities = 28/69 (40%), Positives = 33/69 (47%), Gaps = 6/69 (8%) 

Query: 142 AHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAY 201 

AHA +V T+ R Q H + N K N R Y +D Y NPADI+EDT Y 

Sbjct: 1594 AHAPDWTEA RMYQPHWRKN - KPNGGSGDFRLS SNYDAHD I Y YLNPAD ILEDT - - P Y 1647 

Query: 2 02 IVPHGGHYH 210 

IPG H 
Sbjct: 1648 ITPDGQKIH 1656 
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tr Q8IJI0 Pre-mRNA splicing factor, putative [PF10_0217] [Plasmodium 538 AA 
falciparum (isolate 3D7)] 

align 

Score = 40.0 bits (92), Expect = 0.23 

Identities = 36/157 (22%) , Positives = 72/157 (44%) , Gaps = 7/157 (4%) 

Query: 807 ENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGN- -STSNSTLEEVP 864 

E+++++ Li + + N + N+ D K E +K K+K S + + S S+ E+ 

Sbjct: 346 EDRSERRRSLSEDRSNSRKRNASSDFKRELNSDDDKKSKKKRSYSASPGSAYKSSSRELK 405 

Query: 865 TVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENK 924 

+ D + + + + + +SY + +V +N + E E +K G A + G+NK 

Sbjct: 406 SQDKSNDRLSESKKSYKSQSASVRYN STEEKKSDDETVKPKRGG - RGNA- KNTGKNK 460 

Query: 92 5 PS ENGKVS TGT VENQPTENKP AD S L PE APNE KP VKPE 961 

+ + K T EM +++ D +E+ KPE 

Sbjct: 461 NTKGSKKGAKTEENHDSKSVSNDRNKSDKSEEKEKPE 497 



tr 096609 Surface antigen ariell [Entamoeba histolytica] 215 AA 

align 

Score =40.0 bits (92), Expect = 0.23 

Identities = 42/163 (25%), Positives = 64/163 (38%), Gaps = 18/163 (11%) 

Query: 807 ENQTDKPS ILPQFKRNKAQENSKLDEKV- - EEPKTSEKVEKEKLSETGNSTSNSTLEEVP 864 

EN+ DK K+ NS+LDE + ++S E N +S++ E 

Sbjct: 45 ENEEDK KS S SNSELDENSNNQPDE S SNNKPNES SDNKPNES SDNKPNES S 94 

Query: 865 TVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENK 924 

P + K ES K N P+ E ++ + P + NK 

Sbjct : 95 NNKPSESSNNKPDES SNNKPNES SDNKPNESSNNKPN-ESSNNKPSESSNNKPDESSNNK 153 

Query: 925 PSENGKVSTGTVENQPTENKPADSLPEAPNEKP-VKPENSTDN 966 

P+E S+ N+ + NKP +S PNE KP -fS+DN 

Sbjct: 154 PNE S SDNKPNE S SNNKPNES SDNKPNES SNNKPGS S SDN 192 



tr 017102 Hypothetical protein F42G2.6 [F42G2.6] [Caenorhabditis 733 
elegans] AA 

align 

Score =40.0 bits (92), Expect = 0.23 

Identities = 21/91 (23%) , Positives = 45/91 (49%) 

Query: 806 KENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPT 865 

KE T KP + + +K ++N L + + K +E EK ++ T N+ E++ 

Sbjct: 578 KEETTTKPEYRKKLRMSKRKQNELLQKLWLKEKNTENTHAEKTNQEKKLTENAVQEQLEQ 637 

Query: 866 VDPVQEKVAKFAESYGMKLENVLFNMDGTIE 8 96 

+ + + + 4- A+S + KLE M + + 

Sbjct: 638 MMKMIQEERRAADSKNLKLEQMVLEMKDKLD 668 
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sp_vs Q28 8 2Q- Splice isoform Skeletal 2 of Q28820 [TRDN] [Oryctolagus 689 
,5 cuniculus AA 

(Rabbit)] align 

Score =40.0 bits (92), Expect =0.23 

Identities = 40/187 (21%), Positives = 69/187 (36%), Gaps = 12/187 (6%) 

Query: 8 05 EKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVP 8 64 

++ + + K + P ++ K + + K EK EEP S K E ++ ++ ST + VP 

Sbjct: 374 DEKKEDSKKAKKPAEEQPKGKKQEK-KEKHEEPAKSTKKEHAEIERKEEVSAASTKKAVP 432 

Query: 865 TVDPVQEKVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENK 924 

+EK KEK GILE+K+ + E K 

Sbjct: 433 AKK- -EEKTTKTVEQETRK EKPGKISSVLKDKELTKEKEVKVPASLKEKGSETK 484 

Query: 925 PSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPMLDPA 984 

E ++ E KP + P + +K E + ++ PE + 
Sbjct: 485 KDEKTSKPEPQIKK EEKPGKEVKPKPPQPQIKKEEKPEQDIMKPEKTALHGKPEEKV 541 

Query: 985 LEEAPAV 991 

L++ AV 
Sbjct: 542 LKQVKAV 54 8 



Score =35.8 bits (81), Expect =4.4 

Identities = 22/82 (26%) , Positives = 40/82 (47%) , Gaps = 2/82 (2%) 

Query: 7 99 VEVPILEKENQTDKPSILPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNS 85 8 

+ ++ KE + +K + + K K + + EK E+P+T 4- + EK + T 
Sbjct: 158 IPTKVVHKEKEKEKEKVKEKEKPEKKATHKEKLEKKEKPETKTVTKEEKKARTKEKIEEK 217 

Query: 859 TLEEVPTV- -DPVQEKVAKFAE 878 

T +EV V + V++ VAK E 
Sbjct: 218 TKKEVKGVKQEKVKQTVAKAKE 23 9 



S P P19814 Trans-Golgi network integral membrane protein TGN38 357 
TGN3_RAT precursor [Ttgnl] AA 

[Rattus norvegicus (Rat) ] align 



Score 


= 39, 


. 7 bits (91) , 


Expect = 0.3 0 




Identities 


= 56/206 (27% 


), Positives = 84/206 (40%), Gaps = 42/206 (20%) 




Query : 


824 


AQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVA-KFAESYGM 


882 






A+ + VE 


+ +K ET + + TL+ + +- VDP QEK KF 4- G 




Sbjct: 


73 


ARTSASVSSGVESATNLNLDDSKKHPETADAKLKETLQQLLPVDPKQEKSGQKFTKDSGS 


132 


Query: 


883 


KLENVL FNMDGT IELYLPSGEV I KKNMAD FTGEAPQGNGENKP - - S ENGKVS TGTVENQP 


940 






N G 


G+ K D + G NKP S+N K TG N+P 




Sbjct: 


133 


PTGGDSDNTTG-- 


GDSNKTTGVD - - SDKTSGGDSNKPTGSDNDK- PTGGDSNKP 


181 


Query : 


941 


TENKPADSLPEAP 


NEKPVKPE - - NS TDNG MLNPE - GNVGSDPML 


981 






T P + + + E P 


EK KP + T++G L PE G+ S+P 




Sbjct : 


182 


TSKVPSNT- -ETPKIDKVQLTEKGQKPTLISKTESGEKLAGDSDFSLKPEKGDKSSEPTE 


239 
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Query: 982 DPALEEAPAVD PVQEKLEK 10 0 0 

D +E D P4-+E+ EK 

Sbjct: 240 DVETKEIEEGDTEPEEGSPLEEENEK 265 



sp 051498 DNA polymerase I (EC 2.7.7.7) (POL I) [polA] [Borrelia 908 

DP01_BORBU burgdorferi AA i 

(Lyme disease spirochete) ] ^1^35: 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 56/274 (20%), Positives = 114/274 (41%), Gaps = 11/274 (4%) 

Query: 63 8 NMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDG 697 

3ST L +I + PD +L+ +S V + N + E+D Y+ + + KDY + D 

Sbjct: 126 NYLTYIISPDKDLLQTMSEYVKILKIENNSFIEMDNEYVTKKFGVNSFQIKDYLAIVGDR 185 

Query: 698 TFTVP- -TSLAYKMASQTIFYPFHAGDTYLRVNPQFAVPKGTDALVRVFDEFHGNAYLEN 755 

+ + p + F D + N+ K + L++ + NA+L 
Sbjct: 186 SDNIPGIKGIGAKGAA-NLLREFKTLDG- IYSNLEIINKKHRELLIKEKE NAFLSY 239 

Query: 756 NYKVGE I KLP I PKLNQGTTRTAGNKI PVTFMANAYLDNQSTY I VE VP ILEKENQTDKPS I 815 

E L IP++ + +1 F + + + TY ++ EKEN K 

Sbjct: 240 ELVSLEENLKIPEIENFALKNFSEEIISLFEKHSAIALIKTYKKDILKQEKENADQKSLF 299 

Query: 816 LPQFKRNKAQENSKLD - EKVEEPKTSEKVEKEKLSETGNSTSNSTLE - EVPTVDPVQEKV 873 

+ n + + +D E V+ + K+E + L E + + + + E + +D K+ 

Sbjct: 300 KQEPTTNSLDDINTIDTENVKYRSITTKIELDDLIESLKKAKYISIDTETSSLDTYTAKL 359 

Query: 874 AKFAESYGMKLENVLFNMDGTIELYLPSGEVIKK 907 

-f S+ + E ++ ++Y+ + 

Sbjct: 360 IGISISF-KEFEGYYIPIEAKGKIYIEKNYIIQK 392 



S P P42759 Dehydrin ERD10 (Low- temperature- induced protein LTI45) 260 

DH10_ARATH [ERD10] AA _ 

[Arabidopsis thaliana (Mouse-ear cress)] align 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 40/162 (24%), Positives = 76/162 (46%), Gaps = 9/162 (5%) 

Query: 7 92 DNQSTYIVEVP ILEKENQTDKPS I LPQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSET 851 

+ + + t + +E + +KPS+L + R+ + +S DE+ E+ + +K +K+K+ E 

Sbjct : 74 EHKPTLLEQLHQKHEEEEENKPSLLDKLHRSNSSSSSSSDEEGEDGEKKKKEKKKKIVE- 132 

Query: 852 GNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDG- -TIELYLPSGEVIKKNM 909 

G+ E +D ++EK EG V+ M ++E + P E KK 

Sbjct: 133 GDHVKTVEEENQGVMDRIKEKF-PLGEKPGGDDVPWTTMPAPHSVEDHKPEEEE-KKGF 190 

Query: 910 ADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPE 951 

D E G+ + KP ++ V+T + TP +PE 
Sbjct: 191 MDKIKEKLPGHSK- KPEDSQWNTTPLVETAT PIADIPE 228 
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sp P13952 G2/mitotic-specif ic cyclin B [Spisula solidissima 428 

CG2B_SPISO (Atlantic AA 

surf -clam)] align 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 41/140 (29%), Positives = 58/140 (41%), Gaps = 11/140 (7%) 

Query: 83 7 PKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMDGTIE 8 96 

P TS L + GN S T+ +VP DP+ +K S+ K +L T 
Sbjct: 32 PHTSHASQRNTLGD IGNQVS AIT I SDVPRKDP 1 1 KKE I VHLS SHQHK ILTKSKATTS 88 

Query: 897 L-YLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPAD SLPEA 952 

L L IK FT P KP+ V T TV QPT P D + + PE + 
Sbjct : 89 LKSLAEESHIPKKQEAFTFLEPVA-AMPKPT TVPTATVLPQPTVPVPMDISENVPES 144 

Query: 953 PNEKPVKPENSTDNGMLNPE 972 

+ + +N N NP+ 
Sbjct: 145 FSRVLLNVQNIDANDKENPQ 164 



tr Q63575 Trans golgi network (TGN) specific integral membrane protein 380 
TGN38 AA 
precursor [tgn41] [Rattus norvegicus (Rat)] align 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 56/206 (27%), Positives = 84/206 (40%), Gaps =42/206 (20%) 

Query : 824 AQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVA-KFAESYGM 882 

A+ ++ + VE + +K ET ++ TL+ + + VDP QEK KF + G 

Sbjct : 73 ARTSASVSSGVESATNLNLDDSKKHPETADAKLKETLQQLLPVDPKQEKSGQKFTKDSGS 132 

Query : 8 83 KLEMVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKP - - SENGKVSTGTVENQP 94 0 

N G G+ K D + G NKP S+N K TG N+P 

Sbjct : 133 PTGGDSDNTTG GDSNKTTGVD- -SDKTSGGDSNKPTGSDNDK-PTGGDSNKP 181 

Query: 941 TENKPADSLPEAP NEKPVKPE- -NSTDNG MLNPE-GNVGSDPML 981 

T P+ + + E P EK KP 4- T++G L PE G+ S + P 

Sbjct : 182 TSKVPSNT- -ETPKIDKVQLTEKGQKPTLISKTESGEKLAGDSDFSLKPEKGDKSSEPTE 239 



Query: 982 DPALEEAPAVD PVQEKLEK 1000 

D +E D P++E+ EK 

Sbjct: 240 DVETKEIEEGDTEPEEGSPLEEENEK 265 



tr Q9KK42 Surface protein PspC [pspC] [Streptococcus 699 AA 

pneumoniae] align 

Score = 39.7 bits (91), Expect =0.30 

Identities = 48/228 (21%), Positives = 94/228 (41%), Gaps = 21/228 (9%) 

Query: 786 MANAYLDNQSTYIVEVPILEKENQTDKPS ILPQFKRN KAQENSKLDE 832 

+ N D + TY+ E+ +LE +++ + PS QFK++ ++ ++ + 

Sbjct: 89 LINKLQDIKRTYLYELNVLEDKSKAELPSKIKAELDAAFEQFKKDTLPTEPGKKVAEAKK 148 

Query : 833 KVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESYGMKLENVLFNMD 892 
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KVEE + K +KE + + + TLE V+ K AE +K E + 
Sbjct: 149 KVEEAEKKAKAQKEEDYRNYPTITYKTLELEIAESDVK VKEAELELVKKEADESRNE 205 

Query: 893 GTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKVSTGTVENQPTENKPADSLPEA 952 

GTI +++A+T KEK E++ + + + 

Sbjct: 206 GTIN QAKAKVESEQAEATRLKKIKTDREKAEEEAKRRADAKEQDESKRRKS -RVKRG 261 

Query: 953 PNEKPVKPENSTDNGMLNPEGNVGSDPMLDPALEEAPAVDPVQEKLEK 1000 

+ P P+ DN + + +VG + 4- P+L+ V 4-+K+E + 
Sbjct: 262 DLGEPATPDKK-DNDAKSSDSSVGEETLPSPSLKPGKKVADAEKKVEE 308 



tr Q7P6T9 TonB protein [FNV16 02] [Fusobacterium nucleatum subsp. 2 54 

vincentii AA 
ATCC 4 92 56] ali gn 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 10/77 (12%) 

Query: 807 ENQTDKPSILPQFKR- -NKAQENSKLDEKV EEPKTSEKVEKEKLSETGNSTSNSTLE 861 

E DKP P+ +R KA-f EN K V E + PKT+ K EK L+ + SNS + 

Sbjct : 64 EKPEDKPIEKPEKERPEKKAEENKKAEKTVQVEEKPKTTPKKEKPSLADLKKQISNSQPK 123 

Query: 862 E VPTVDPVQEKV 873 

PT DP E+V 
Sbjct: 124 TSNGGFSPTADPDGEEV 140 



tr Q8H7D1 Hypothetical protein [Arabidopsis thaliana (Mouse-ear cress)] 346 AA 

align 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 68/285 (23%), Positives = 107/285 (36%), Gaps = 35/285 (12%) 

Query: 714 IFYPFHAGDTYLRVNPQFAVPKGTDA LVRVFDEFHGNAYL -ENNYKVGEIKLPIP 767 

+ FYP AGT + FPG+ L + GN+E + GE 

Sbjct: 21 LFYPKRAG - TPRKTE I VFVAPTGEE I SSRKQLEQYLKAHPGNPVI SEFEWTTGETPRRS S 79 

Query: 768 KLNQGTTRTAGNKI PVTFMANAYL DNQSTYIVEVPILEKENQ-TDKPSILPQFKRNK 823 

+++Q T +K P+ + L DN+ KEN DK + K 

Sbjct: 80 RISQKVKATP-DKEPLLKKRRSSLTKKDNKEAAEKNEEAAVKENMDVDKDGKTENAEAEK 138 

Query: 824 AQENSKLDEKVEEPKTS EKVEKEKLSETGNSTSNSTLEEVPTVDPVQEKVAKFAESY 880 

+E + E E K + EK E EK+ + + G T + + 4-EK + AE + 

Sbjct: 13 9 EKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGKEGQTEIAEAEKEKEGEKAEAE 198 

Query: 881 GMKLENVLFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSE NGKVST 933 

4-EV D + + + E+KK AGG +PS+ K + 

Sbjct: 199 NKEAEW RDKKESMEVDTSELEKK AGSGEGAEEPSKVEGLKDTEMKEAQ 247 

Query: 934 GTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSD 978 

V E KPA+ E N+ V E + + + E N+ SD 

Sbjct : 248 EWTEADVEKKPAEEKTE - -NKGSVTTEANGEQNVTLGEPNLDSD 290 



http://us.expasy.org/cgi-bin/blast.pl 



9/27/04 



ExPASy BLAST2 Interface 



Page 116 of 116 



tr Q28687 Neurof ilament-H (Fragment) [Oryctolagus cuniculus (Rabbit)] 606 AA 

al ign 

Score = 39.7 bits (91), Expect = 0.30 

Identities = 44/196 (22%), Positives = 80/196 (40%), Gaps = 16/196 (8%) 

PQFKRNKAQENSKLDEKVEEPKTSEKVEKEKLSETGNSTSNSTLEEVPTVD PVQEK 872 

p + + + +e +K EK + P+ ++ KE+ +++ + EE + + PV+E+ 

PEKAKS PVKEEAKS PEKAKS PE KAKS PVKEE - AKS PEKAKS P VKEEAKS PE KAKS PVKEE 413 

VAKFAESYGMKLENV-LFNMDGTIELYLPSGEVIKKNMADFTGEAPQGNGENKPSENGKV 931 

AK E K E V + +E P+ ++K+ EAP + E K 



STGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPML DPA 984 

ST + E+K + + ++ KP+ T+ PE +P P 



EE PA +E E+ 



Query: 


817 


Sbjct : 


355 


Query : 


873 


Sbjct : 


414 


Query: 


932 


Sbjct: 


471 


Query: 


985 


Sbjct: 


531 



Database: EXPASY/UniProt 

Posted date: Sep 26, 2004 5:48 PM 
Number of letters in database: 502,311,625 
Number of sequences in database: 1,569,420 

Lambda K H 

0.309 0.131 0.365 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

length of query: 1039 

length of database: 502,311,625 

effective HSP length: 134 

effective length of query: 905 

effective length of database: 292,009,345 

effective search space: 264268457225 

effective search space used: 264268457225 



T: 


11 








A: 


40 








XI 


16 


( 7 


1 


bits) 


X2 


38 


(14 


6 


bits) 


X3 


64 


(24 


7 


bits) 


SI 


42 


(21 


7 


bits) 


S2 


78 


(34 


7 


bits) 
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DB=PGPB,USPT,USOC,EPAB,JPAB,DWPI,TDBD; PLUR=YES; OP=AND 

□ LI (sillery or brodeur or rioux or bizard or charland).in. 828 
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END OF SEARCH HISTORY 



http ://westbrs : 9000/bin/cgi-birfe 



9/27/04 



Record Display Form 



Page 1 of 1 



Sep 2, 2004 



First Hit 



L3 : Entry 1 of 57 File: PGPB 

PG PUB -DOCUMENT -NUMBER : 20040171802 
PGPUB- FILING- TYPE : new 

DOCUMENT- IDENTIFIER: US 20040171802 Al 

TITLE: Haemophilus influenzae antigens and corresponding dna fragments 
PUBLICATION-DATE: September 2, 2004 
INVENTOR- INFORMATION : 

NAME CITY STATE COUNTRY RULE -47 

H amel, Josee 

Couture, France 

B rodeur, Bernard R. 

Martin, Denis 

Quel let, Catherine 

Tremblay, Mireille 

Charbonneau, Annie 

Vayssier, Catherine 

APPL-NO: 10/ 398186 [PALM] 
DATE FILED: April 2, 2 003 

RE LATED - US - AP PL - DATA : 

Application is a non-provisional -of -provisional application 60/236712 filed 
October 2, 2000, 

PCT-DATA: 

DATE-FILED APPL-NO PUB-NO PUB-DATE 371-DATE 102 (E) -DATE 

Oct 2, 2001 PCT/CA01/01402 

INT-CL: [07] C07 K 14/195, C12 N 1/21, CCT7 H 21/04 

US-CL-PUBLISHED: 530/350; 435/069.3, 435/320.1, 435/252.3, 536/023.7 
US -CL- CURRENT: 530/350; 435/ 252.3 , 435 / 320.1 , 435/ 69.3 , 536/23.7 

REPRESENTATIVE - FIGURES : NONE 



ABSTRACT: 

The present invention relates to polypeptides of Haemophilus influenzae which may 
be used for prophylaxis, diagnostic and/or therapy purposes. 



Sillery 


CA 
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CA 
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Search Results - Record(s) 1 through 50 of 57 returned. 



r 1. 20040171802. 02 Apr 03. 02 Sep 04. Haemophilus influenzae antigens and corresponding dna 
fragments. Hamel, Josee, et al. 530/350; 435/252.3 435/320.1 435/69.3 536/23.7 C07K014/195 
C12N001/21 C07H021/04. 



□ 2. 20040171113. 19 Apr 04. 02 Sep 04. Antigens of group b streptococcus and corresponding dna 
fragments. Martin, Denis, et al. 435/69.1; 435/252.3 435/320.1 530/350 536/23.7 C07H021/04 
C12N001/21 C07K014/315. 



f~l 3. 20040132652 . 28 Aug 03. 08 Jul 04. Pharmaceutical compositions. Martin, Denis, et al 514/12- 
424/450 A61K009/127 A61K038/17. 



D 4. 20040122509. 20 Dec 02. 24 Jun 04. Radiopaque ePTFE medical devices. Brodeur. Christophi 
Brian. 623/1.34; 600/431 623/901 A61F002/06. 



□ 5. 20040097706. 1 8 Nov 03. 20 May 04. Streptoco ccus pyogenes antigens and corresponding dna 
fragments. Martin, Denis, et al. 530/350; 424/190.1 C07K014/315 C12Q001/68 A61K039/02. 



F~ 6. 20040071730. 26 Sep 03. 15 Apr 04. Bvh-a2 and bvh-a3 antigens of group b streptococcus . 
Martin, Denis, et al. 424/190.1; 435/252.3 435/320.1 435/6 435/69.1 435/7.32 530/350 C07K014/315 
C12N015/74 G01N033/554 G01N033/569 C12Q001/68 A61K039/02 



□ 7. 20040029248. 16 Jun 03. 12 Feb 04. CHD5 encoding nucleic acids, polypeptides, antibodies 
and methods of use thereof. Brodeur. Garrett M., et al. 435/199; 435/320.1 435/325 435/6 435/69 1 
536/23.2 C12N009/22 C12Q001/68 C07H021/04 C12P021/02 C12N005/06. 



r 8 - 20030232976. 20 Dec 02. 18 Dec 03. Streptoc occus antigens. Hamel. Josee, et al. 536/23 1 • 
C07H021/02 C07H021/04. 



(J 9. 2 0030228323 . 13 Jan 03. 1 1 Dec 03. Novel group B stre ptococcus antigens. Br odeur . Bernard 
R.,etal. 424/184.1; A61K039/00 A61K039/38. 



Ij 10. 20030198943. 23 Jan 02. 23 Oct 03. Identification of a large number of biological (micro) 
organisms groups at different levels by their detection on a same array. Remade, Jose et al 435/5- 
435/287.2 435/6 435/7.22 435/7.32 C12Q001/70 C12Q001/68 G01N033/53 G01N033/569 
GO1N033/554 C12M001/34. 



Ij 11. 20030165528. 22 Apr 03. 04 Sep 03. Streptococcus pyogenes antigens. Martin, Denis, et al 
424/190.1; 435/252.3 435/320.1 435/69.3 530/350 536/23.7 A61K039/02 C07H021/04 C07K014/195 
C12P021/02 C12N001/21 C12N015/74. 



Ij 12. 20030124.522. 19 May 00. 03 Jul 03. Method for the identification and/or the quantification of 
a target compound obtained from a biological sample upon chips. Remacle, Jose, et al 435/6- 435/7 9 
C12Q001/68 G01N033/53 G01N033/542. 



H 13. 20030077293. 20 Jun 01. 24 Apr 03. Stre ptococ cus antigens. Hamel. Josee et al 424/190 1 • 
435/183 435/252.3 435/320.1 435/69.3 536/23.7 A61K039/02 C07H021/04 C12N009/00 C12P021/02 
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C12N001/21 C12N015/74. 



f! 14. 20030059896. 20 Dec 01 . 27 Mar 03. Novel chlamydia antigens and corresponding DNA 
fragments. Couture, France, et al. 435/69.3; 435/252.3 435/320.1 435/7.36 530/350 536/23.7 
G01N033/571 C07H02 1/04 CI 2P02 1/02 C12N001/21 C07K014/295 C12N015/74. 

r 15. 20030049271. 21 Feb 02. 13 Mar 03. Streptococcus pyogenes polypeptides and corresponding 
DNA fragments. Martin, Denis, et al. 424/190.1; 435/252.3 435/320.1 435/69.3 435/7.32 530/350 
536/23.7 A61K039/02 G01N033/554 G01N033/569 C07K014/315 C07H021/04 C12P021/02 
C12N001/21. 

□ 16. 2 0030031682 . 1 8 Feb 99. 1 3 Feb 03 . NOVEL GROUP B STR^TOCOCCus ANTIGENS 
BRODEUR. BERNARD R., et al. 424/190.1; A61K039/02. 

□ 17. 20030007981. 13 Mar 02. 09 Jan 03. Cloning and expression of Haemophilus somnus 
transferrin-binding proteins. Potter, Andrew A., et al. 424/190.1; 424/256.1 435/252.3 435/320 1 
435/69.3 530/350 536/23.7 A61K039/02 A61K039/102 C07H021/04 C07K014/195. 

T 18. 200201 06646 . 23 Mar 01 . 08 Aug 02. Identification of biological (micro) organisms by 
detection of their homologous nucleotide sequences on arrays. Remacle, Jose, et al. 435/6; 435/287 2 
435/91.2 C12Q001/68 C12P019/34 C12M001/34. 

□ 19. 6743579. 20 Oct 00; 01 Jun 04. Glycerol as a predictor of glucose tolerance. Gaudet; Daniel et 
al. 435/6;. C12Q001/68. 

□ 20. 6583333. 12 May 98; 24 Jun 03. Lymphoma-susceptible transgenic mice and methods for 
studying drug sensitivity of lymphomas. Lowe; Scott W., et al. 800/3; 424/9.1 424/9.2 435/325 435/352 
435/354 435/355 800/13 800/14 800/18. G01N033/00 A01K067/033 A01K067/027 C12N005/06 
A61K049/00. 

IT 21. 6391316 . 24 Sep 99; 21 May 02. Vaccine compositions comprising Haemophilus somnus 
transferrin-binding proteins and methods of use. Potter; Andrew A., et al. 424/256.1- 424/185 1 
424/190.1 424/193.1 530/350. A61K039/102. 

r 22. 6287574. 13 Nov 97; 1 1 Sep 01. Proteinase K resistant surface protein of neisseria 
meningitidis. Brodeur ; Bernard R., et al. 424/250.1; 424/184.1 424/185.1 424/190.1 424/249 1 530/300 
530/350 536/23.7. A61K039/095. 

□ 23. 6242229. 05 May 98; 05 Jun 01. Cosmetic/pharmaceutical compositions comprising 
microorganism culture media. Pineau : Nathalie, et al. 435/170; 435/261. A61K035/66. 

I~ 24. 6100066. 15 Mar 96; 08 Aug 00. Nucleic acid molecules encoding Haemophilus somnus 
proteins. Potter; Andrew A., et al. 435/69.3; 424/256.1 435/252.3 435/320.1 435/69.1 536/23 1 536/23 7 
536/24.32. C12P021/06. 

T 25. 5919620 07 Jun 95; 06 Jul 99. Heat shock protein HSP72 of Streptococcu s pneumoniae. 
Brodeur; Bernard R., et al. 435/6; 435/4 435/69.1 435/963 536/23.4 536/23.7. C12O001/06 C12Q001/68 
C12P021/06 G01N033/53. 

C 26. 5 837204 . 18 Apr 97; 17 Nov 98. System for disinfecting the water lines of a dental unit. 
Prevost; Andre, et al. 422/105; 134/1 66C 134/169C 137/511 210/136 422/117 422/28 433/80. 
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□ 27. 57804 24. 30 Sep 96; 14 Jul 98. Purified ribosomal fractions separated from the 
nonphotosynthetic filamentous bacteria beggiatoales. Pineau; Nathalie, et al. 514/2; 424/234.1 424/282 1 
424/489 424/490 435/820 514/44 514/885 514/886 514/887. A61K038/00 A61K031/70 A61K039/02 
A61K009/14. 



□ 28. 5710170. 10 Dec 96; 20 Jan 98. Tri-aryl ethane derivatives as PDE IV inhibitors. Guay; 
Daniel, et al. 514/332; 514/341 514/365 546/266 546/272.7 548/203. C07D213/30 C07D233/64 
A61K031/44 A61K031/415. 



□ 29. 5554530. 04 Aug 94; 10 Sep 96. Aseptic in vitro endomycorrhizal spore mass production. 
Fortin; J. Andre, et al. 435/256.8; 435/252.5 435/254.1 435/254.11 435/911 47/1.1. C12N001/14 
C12N001/04 A01H015/00 A01H017/00. 



IJ 30. 5455266. 10 Sep 93; 03 Oct 95. Enhanced chemotherapeutic compositions against microbial 
infections in fish, the preparation and use thereof. Kusuda; Riichi, et al. 514/450; 514/649 514/653 
514/659. A61K03 1/135 A61K031/335. 



□ 31. 5013664- 28 May 86; 07 May 91 . Common protein of Haemophilus influenzae type b 
identified by a monoclonal antibody. Brodeur : Bernard R., et al. 435/7.32; 435/340 435/70.21 435/810 
435/851 435/948 436/548 530/350 530/388.4. C12N005/12 C07K015/78 G01N033/53 G01N033/535 



t~ 32. WO200400772 5A1. 15 Jul 03. 22 Jan 04. POLYPEPTIDE OF STREPTOCO CCUS 
PYOGENES. RIOUX, STEPHANE, et al. C12N015/31; C12N015/62 C12N015/63C07K014/315 
C12N005/10 G01N033/53 A61K038/16. 



C 33. WO0030 68813A2 . 1 1 Feb 03. 21 Aug 03. GROUP B mEEPTOCOCCUS ANTIGENS 
RIOUX, STEPHANE, et al. C07K014/315;. 



r~ 34. WO003054007A2. 20 Dec 02. 03 Jul 03. STREPTOCOCCI IS ANTIGENS. HAMEL JOSEE 
etal. C07K014/315;. 



H 35. WO003004650A2. 05 Jul 02. 16 Jan 03. GROUP B STREPTOCOCCUS ANTIGENS AND 
CORRESPONDING DNA FRAGMENTS. MARTIN, DENIS, et al. C12N015/31; C12N015/63 
C07K014/315 C07K019/00 A61K039/09 G01N033/569. 



H 36. WO002088178A2. 02 May 02. 07 Nov 02. ANTIGENS OF GROUP B STREPTOCOCCUS 
AND CORRESPONDING DNA FRAGMENTS. MARTIN, DENIS, et al. C07K014/315;. 



□ 37. WO002079475A2. 28 Mar 02. 10 Oct 02. STREPTOCOCCUS PYOGENES ANTIGENS 
AND CORRESPONDING DNA FRAGMENTS. MARTIN, DENIS, et al. C12N015/3L C12N015/63 
C07K014/315 C12N015/62 A61K039/09 C07K016/12 G01N033/53. 



T 38. WO002066650A2. 2 1 Feb 02. 29 Aug 02. <u>STREPTOCOCCUS PYOGENES</u> 
POLYPEPTIDES AND CORRESPONDING DNA FRAGMENTS. MARTIN, DENIS et al 
C12N015/31; C12N015/63 C07K014/315 C07K019/00 A61K039/09 A61P017/00 A61P031/00 
G01N033/569. 



T 39. WO00994258 _8A2. 1 7 Feb 99. 26 Aug 99. GROUP B STOEJiTQCQCCUS ANTIGENS 
BRODEUR, BERNARD R, et al. C12N015/31; C07K014/315 A61K039/09 C12N001/21. 
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T 40. WO009640928AL 17 May 96. 19 Dec 96. STREPTOCOCCA L HEAT SHOCK PROTEINS 
MEMBERS OF THE HSP70 FAMILY. HAMEL, JOSEE, et al. C12N0 15/31; C07K014/315 
C12NQ01/21 C07K016/12 A61K039/09 A61K039/395 G01N033/569 C12Q001/68 



P 4 1 • WO20Q4048575A. New Streptococcus pneumoniae polynucleotides and polypeptides, useful 
in preventing or treating otitis media, sinusitis, pneumonia, meningitis, bacteremia, or septicaemia 
BLAIS, N, et al. A61K038/16 A61K039/09 C07K014/315 C12N005/10 C12N015/31 C12N015/62 
G01N033/53. 



L 42. WO2004007725A. New polypeptide of Streptococcus pyogenes, useful for the therapeutic or 
prophylactic treatment of infections, including pharyngitis, erysipelas, impetigo, scarlet fever, 
bacteremia and necrotizing fascitis. BRODEUR. B R, et al. A61K038/16 C07K014/315 C12N005/10 
C12N015/31 C12N015/62 C12N015/63 G01N033/53. 



□ 43 . WO2003068813A. New Sip polynucleotide, useful for preparing a composition for 
diagnosing, treating or preventing Streptococcus infection e.g. sepsis, meningitis, pneumonia, septic 
arthritis, endocarditis or epiglottis. B RODEUR. B R, et al. C07K0 14/315. 



□ 44. WO20Q3054007A- New isolated polypeptides of S treptococcus pneumoniae, useful for 
diagnosing, preventing or treating streptococcal infection, meningitis, otitis media, bacteremia or 
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L3: Entry 25 of 57 File: USPT Jul 6, 1999 

US-PAT-NO: 5919620 

DOCUMENT -IDENTIFIER: US 5919620 A 

TITLE: Heat shock protein HSP72 of Streptococc us pneumoniae 
DATE-ISSUED: July 6, 1999 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

B rodeur ; Bernard R. Sillery CA 

Martin; Denis St.-Augustin CA 

Hame 1 ; Josee Sillery CA 

US -CL- CURRENT: 435/6; 435/4, 435 / 69.1 , 435/ 963 , 536 / 23.4 , 536/ 23.7 
CLAIMS : 

We claim: 

1. An isolated and purified S treptococcus pneumoniae heat shock protein having 
a molecular weight of about 72 kDa as measured by SDS-PAGE. 

2. An isolated and purified protein according to claim 1, comprising amino acid 
acid sequence SEQ ID NO : 5 . 

3. An isolated and purified protein according to claim 2, consisting of amino 
acid sequence SEQ ID NO : 5 . 

4. An isolated and purified immunogenic polypeptide comprising an amino acid 
sequence chosen from the group consisting of: 

SEQ ID NO: 5; 

residues 527 to 541 of SEQ ID NO : 5 or an immunogenic fragment thereof; 

residues 510 to 607 of SEQ ID NO : 5 or an immunogenic fragment thereof; 

residues 439 to 607 of SEQ ID NO : 5 or an immunogenic fragment thereof; 

residues 586 to 600 of SEQ ID NO : 5 or an immunogenic fragment thereof; and 

residues 244 to 330 of SEQ ID NO : 5 or an immunogenic fragment thereof; 

wherein said polypeptide generates a monoclonal antibody not reactive with E. 
coli or S. aureus. 
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et al . 1995. "Cloning of an Enterococcus faecalis endocarditis antigen: homology 
with some adhesins from oral streptococci." Infect. Immun 63:703-706) revealed 
significant sequence similarity between them. Sequence identities were 81%, 81%, 
77%, 82%, and 57%, respectively, for PsaA (S. pneumoniae strain R3 6A) , SsaB (S. 
sanguis), FimA (S. parasanguis) , ScaA (S. gordonii) and EfaA (E. faecalis). 
Additionally, all six sequences showed great similarity in organization. They have i 
a hydrophobic leader peptide containing the prolipoprotein consensus sequence LXXC 
(for signal peptidase II cleavage) within the first 17-20 amino acids. This N- ~~ 
terminal leader sequence appears to represent the area of greatest variability. It 
is followed by a region of high similarity from amino acids 36-150. The region from 
150 to 198 is a variable region and is followed by another conserved reqion (198- 
309) . 



Full | Title [ Citation [~TTont~|~l3eview JTialstfiojtion [~ Date "["Reference 



□ 2. Document ID: UjS 6582706 Bl 

Ll: Entry 2 of 5 



File: USPT 



Jun 24, 2003 



DOCUMENT- IDENTIFIER: US 6582706 Bl 

TITLE: Vaccine compositions comprising Streptococcus pneumoniae polypeptides having 
selected structural MOTIFS 



Detailed Description Text (65) : 

The genomic DNA used as target for amplification was isolated from S. pneumoniae 
Norway strain (serotype 4) , the same strain used for genomic sequencing. The 
complete sequence of the Sp36 gene (SEQ ID NO: 9), and its predicted amino acid 
sequence (SEQ ID N0:8), are given in the Sequence Listing appended hereto. It was 
noted that the predicted amino acid sequence included a hydrophobic leader sequence 
followed by a sequence (LSVCl similar to the consensus sequence for Type II signal 
^peptidase ( LxxC, in which" both x's typically represent small amino acids) . Primers 
(listed as SEQ ID NOS: 1-3) were designed that would amplify the Sp36 gene and allow 
its cloning into pQElO and expression as a histidine- tagged protein lacking the 
signal sequence for purification by nickel -affinity chromatography. Cloning of the 
fragment amplified by SEQ ID Nos 1 and 3 would result in a protein containing amino 
acids 21 through 819 of Sp3 6; cloning of the fragment amplified by SEQ ID Nos 2 and 
3 would result in a protein containing amino acids 26 through 819 of Sp36 (amino 
acid numbers refer to SEQ ID NO: 8) . 



i~| Classr 



n 3. Document ID: US 6406883 Bl 

Ll: Entry 3 of 5 File: USPT 



Jun 18, 2002 



DOCUMENT- IDENTIFIER: US 6406883 Bl 

TITLE: Lmb gene of Streptococcus agalactiae 



Detailed Description Text (25) : 
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L2 : Entry 2 of 24 



File: PGPB 



Apr 29, 2 0 04 



DOCUMENT- IDENTIFIER: US 20040081662 Al 
TITLE: Vaccine 



Summary of Invention Paragraph : 

[0010] In one aspect, the present invention is an immunogenic composition 
comprising at least 2 S. pneumoniae proteins selected from the group consisting of 
Poly Histidine Triad family (PhtX) , Choline Binding Protein family (CbpX) , CbpX 
truncates, LytX family, LytX truncates, CbpX truncate-LytX truncate chimeric 
proteins, pneumolysin (Ply), PspA, PsaA, Spl28, SplOl, Spl30, Spl25 and Spl33. In a 
preferred embodiment, one of the proteins is from the Poly Histidine Triad family 

(Phtx) . In another preferred embodiment, one of the proteins is from the Choline 
Binding Protein family (CbpX), or CbpX truncates, or CbpX truncate-LytX truncate 
chimeric proteins . 

Summary of Invention Paragraph : 

[0016] The Streptococcus pneumoniae proteins of the invention are either surface 
exposed, at least during part of the life cycle of the pneumococcus , or are 
proteins which are secreted or released by the pneumococcus. Preferably the 
combination of proteins of the invention are selected from 2 different categories 
such as proteins having a Type II Signal sequence motif of LXXC (where X is any 
amino acid, e.g., the polyhistidine t riad family (phtX) ) , choline binding proteins 
(CbpX), proteins having a Type I Signal sequence motif (e.g., SplOl) , proteins 
having a LPXTG motif (where X is any amino acid, e.g., Spl28, Spl30) , toxins (e.g., 
Ply), etc. Preferred examples within these categories (or motifs) are the following 
proteins, or immunologically functional equivalents thereof. 

Summary of Invention Paragraph : 

[0017] The immunogenic composition of the invention comprises at least 2 proteins 
selected from the group consisting of the Poly Histidine Triad family (PhtX) , 
Choline Binding Protein family (CbpX), CbpX truncates, LytX family, LytX truncates, 
CbpX truncate-LytX truncate chimeric proteins (or fusions) , pneumolysin (Ply) , 
PspA, PsaA, Spl28, SplOl, Spl30, Spl25 and Spl33 . However, if CbpX is PspC, then 
the second protein is not PspA or PsaA. Preferably, the immunogenic composition 
comprises 2 or more proteins selected from the group consisting of the Poly 
Histidine Triad family ( PhtX ) , Choline Binding Protein family (CbpX) , CbpX 
truncates, LytX family, LytX truncates, CbpX truncate-LytX truncate chimeric 
proteins (or fusions), pneumolysin (Ply), PspA, PsaA, and Spl28. More preferably, 
the immunogenic composition comprises 2 or more proteins selected from the group 
consisting of the Poly Histidine Triad family ( PhtX ) , Choline Binding Protein 
family (CbpX), CbpX truncates, LytX family, LytX truncates, CbpX truncate-LytX 
truncate chimeric proteins (or fusions) , pneumolysin (Ply) , and Spl28 

Summar y of Invention Paragraph : 

[0019] With regards to the PhtX proteins, PhtA is disclosed in WO 98/18930, and is 
also referred to Sp36. As noted above, it is a protein from the p olyh i stidine triad 
family and has the type II signal motif of LXXC. 

Summary of Invention Paragraph : 

[0020] PhtD is disclosed in WO 00/37105, and is also referred to Sp036D. As noted 
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above, it also is a protein from the polyhistidine triad family and has the type I 
LXXC signal motif. — 

Summary of Invention Paragraph : 

[0021] PhtB is disclosed in WO 00/37105, and is also referred to Sp036B. Another 
member of the PhtB family is the. C3 -Degrading Polypeptide, as disclosed in WO 
00/17370. This protein also is from the polyhistidine triad family and has the type 
II LXXC signal motif. A preferred immunologically functional equivalent is the 
protein Sp42 disclosed in WO 98/18930. A PhtB truncate (approximately 79 kD) is 
disclosed in W099/15675 which is also considered a member of the PhtX family. 

CLAIMS : 

1. An immunogenic composition comprising at least 2 S. pnumoniae proteins wherein 
one of the proteins is selected from the polyhistidine triad family (PhtX) and 
another protein is selected from the group consisting of Choline Binding Protein 
family (CbpX) , CbpX truncates, LyX family, LytX truncates, CbpX truncate-LytX 
truncate chimeric proteins, pneumolysin (Ply), PspA, PsaA, Spl28, SplOl Sd130 
Spl25 and Spl33 . 

2. An immunogenic composition comprising at least 2 S. pneumoniae proteins wherein 
one of the proteins is selected from the group consisting of Choline Binding 
Protein family (CbpX), CbpX truncates, and CbpX truncate-LytX truncate chimeric 
proteins and another protein selected from the group consisting of polyhistidine 
triad family (PhtX), LytC, pneumolysin (Ply), PsaA, and Spl28. 
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